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DESCRIPTION 



Method Of Diagnosing Breast Cancer 



This application claims the benefit of U.S. Provisional Application Serial 
No.60/505,571 filed September 24, 2003, the contents of which are hereby incorporated by 
reference in its entirety. 



Breast cancer is a complex disease characterized by the numerous genetic and 
epigenetic changes in a large number of genes (Katherine N. N., Richard W. and Barbara L. 
W. Breast cancer genetics: What we know and what we need. Nat Med, 7(5): 552 - 556, 
2001.). Little is known about whether these abnormalities are the cause of breast 
tumorigenesis, although it has been reported to occur by a multi-step process which can be 
broadly equated to transformation of normal cells, including the stages of atypical ductal 
hyperplasia, ductal carcinoma in situ (DCIS) and invasive ductal carcinoma (IDC). Although 
the stages of mammary carcinogenesis are similar to those in otiier tissues, the precise 
molecular mechanisms driving breast cancer remain unknown. In any event, it is readily 
apparent that molecular factors leading to development of primary breast cancer, its 
progression, and its metastasis would be valuable targets for the development of better tools 
for early diagnosis, treatment, and prevention of this disease. 

There is evidence that only a portion of premalignant lesions are committed to 
progression to invasive cancer while the other lesions undergo spontaneous regression. This 
explanation of molecular participation, which'leads to development of primary breast cancer, 
its progression, and its formation of metastases, is the main focus for new strategies targeted 
at prevention and treatment. 

Gene-expression profiles generated by cDNA microarray analysis can provide 
considerably more detail about the nature of individual cancers than traditional 
histopathological methods are able to supply. The promise of such information lies in its 
potential for improving clinical strategies for treating neoplastic diseases and developing 
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novel drags (Petricoin, E. F., 3rd, Hackeft, J. L., Lesko, L. J., Puri, R. K., Gutman, S. L, 
Chumakov, K., Woodcock, J., Feigal, D. W., Jr., Zoon, K. C, and Sistare, F. D. Medical 
applications of microarray technologies: a regulatory science perspective. Nat Genet, 32 
Suppl: 474-479, 2002; Johannes B., Esther Z. and Axel U. Molecular targets for breast cancer 

5. therapy and prevention. Nat Med, 7 (5): 548 - 552, 2001.). With this goal in mind, the 
present inventors have analyzed the expression profiles of tumor or tumors fi^om various 
tissues by cDNA microarrays (Okabe, H.et al.. Genome-wide analysis of gene expression in 
human hepatocellular carcmomas using cDNA microarray: identification of genes involved in 
viral carcinogenesis and tumor progression. Cancer Res, 61: 2129-2137, 2001.; Hasegawa, 

10 S.et al.,Genome-wide analysis of gene expression in intestinal-type gastric cancers using a 
complementary DNA microarray representing 23,040 genes. Cancer Res, 62: 7012-7017, 
2002.; Kaneta, Y.et al,, and Ohno, R. Prediction of Sensitivity to STI571 among Chronic 
Myeloid Leukemia Patients by Genome-wide cDNA Microarray Analysis. Jpn J Cancer Res, 
93: 849-856, 2002.; Kaneta, Y.et al., Genome-wide analysis of gene-expression profiles in 

15 chronic myeloid leukemia cells using a cDNA microarray. Int J Oncol, 23: 68 1 -69 1 , 2003 .; 
Kitahara, O.et al.. Alterations of gene expression during colorectal carcinogenesis revealed by 
cDNA microarrays after laser-capture microdissection of tumor tissues and normal epithelia. 
Cancer Res, 61: 3544-3549, 2001.; Lin, Y. et al. Molecular diagnosis of colorectal tumors by 
expression profiles of 50 genes expressed differentially m adenomas and carcmomas. 

20 Oncogene, 21: 4120-4128, 2002.; Nagayama, S. et al., Genome-wide analysis of gene 
expression in synovial sarcomas using a cDNA microarray; Cancer Res, 62: 5859-5866, 
2002.; Okutsu, J. et al.. Prediction of chemosensitivity for patients with acute myeloid 
leukemia, according to expression levels of 28 genes selected by genome-wide 
complementary DNA microarray analysis. Mol Cancer Ther, 1: 1035-1042, 2002.; Kikuchi, T. 

25 et al.. Expression profiles of non-small cell lung cancers on cDNA microarrays: identification 
of genes for prediction of lymph-node metastasis and sensitivity to anti-cancer drugs. 
Oncogene, 22: 2192-2205, 2003.). 

Recent examination into the expression levels of thousands of genes through the use 
of cDNA microarrays have resulted in the discovery of distinct patterns in different types of 

30 breast cancer (Sgroi, D. C.et al.. In vivo gene expression profile analysis of human breast 
cancer progression. Cancer Res, 59: 5656-5661, 1999.; Sorlie, T.et al.. Gene expression 
patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proc 
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Nad Acad Sci USA, 98: 10869-10874, 2001.; Kauraniemi, P.et al.. New amplified and 
highly expressed genes discovered in the ERBB2 amplicon in breast cancer by cDNA 
microarrays. Cancer Res, 61: 8235-8240, 2001.; Gruvberger, S.et al„ S. Estrogen. receptor 
status in breast cancer is associated with remarkably distinct gene expression patterns. Cancer 
5 Res, 61: 5979-5984, 2001.; Dressmaa, M. et al.. Gene expression profiling detects gene 
amplification and differentiates tumor types in breast cancer. Cancer Res, 63: 2194-2199, 
2003.). 

Studies into gene-expression profiles in breast cancers have resulted in the 
identification, of genes that may serve as candidates for diagnostic markers or prognosis 

10 profiles. However, these data, derived primarily firom tumor masses, cannot adequately reflect 
expressional changes during breast carcinogenesis, because breast cancer cells exist as a solid 
mass with a highly inflammatory reaction and containing various cellular components. 
Therefore, previously published microarray data is likely to reflect heterogenous profiles. 

Studies designed to reveal mechanisms of carcinogenesis have already facilitated the 

15 identification of molecular targets for certain anti-tumor agents. For example, inhibitors of 
famesyltransferase (FTIs) which were originally developed to inhibit the growth-signaling 
pathway related to Ras, whose activation depends on post-translational famesylation, have 
been shown to be effective in treating Ras-dependent tumors in animal models (He et al.. Cell 
99:335-45 (1999)). Similarly, clinical trials on humans using a combination of anti-cancer 

20 drugs and the anti-HER2 monoclonal antibody, trastuzumab, with the aim of antagonizing the 
proto-oncogene receptor HER2/neu have achieved improved clinical response and overall 
survival of breast-cancer patients (Lin et al.. Cancer Res 61:6345-9 (2001)). Finally, a 
tyrosine kmase inhibitor, STI-571, which selectively mactivates bcr-abl fiision proteins, has 
been developed to treat chronic myelogenous leukemias wherein constitutive activation of 

25 bcr-abl tyrosine kinase plays a crucial role in the transformation of leukocytes. Agents of 
these kinds are designed to suppress oncogenip activity of specific gene products (Fujita et al.. 
Cancer Res 61 :7722-6 (2001)). Accordingly, it is apparent that gene products commonly up- 
regulated in cancerous cells may serve as potential targets for developing novel anti-cancer 
agents. 

30 It has been fiorther demonstrated that CD8+ cytotoxic T lymphocytes (CTLs) 

recognize epitope peptides derived from tumor-associated antigens (TAAs) presented on the 
MHC Class I molecule, and lyse tumor cells. Since the discovery of the MAGE family as the 
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first example of TAAs, many other TAAs have beaa discovered using inunimological 
approaches (Boon, Int J Cancer 54: 177-80 (1993); Boon and van der Bruggen, J Exp Med 
183: 725-9 (1996); van der Bruggen et al.. Science 254: 1643-7 (1991); Brichard et al., J Exp 
Med 178: 489-95 (1993); Kawakami et al., J Exp Med 180: 347-52 (1994)). Some of the 

5 newly discovered TAAs are currently undergoing clinical development as targets of 

immunotherapy. TAAs discovered so far include MAGE (van der Bruggen et al., Science 
254: 1643-7 (1991)), gplOO (Kawakami et al., J Exp Med 180: 347-52 (1994)), SART 
(Shichijo et al., J Exp Med 1 87: 277-88 (1 998)), and NY-ESO-1 (Chen et al., Proc Natl Acad 
Sci USA 94: 1914-8 (1997)). On the other hand, gene products demonstrated to be 

10 specifically over-expressed in tumor cells have been shown to be recognized as targets 
inducing cellular immune responses. Such gene products include p53 (Umano et al., Brit J 
Cancer 84: 1052-7 (2001)), HER2/neu (Tanaka et al., Brit J Cancer 84: 94-9 (2001)), CEA 
(Nukaya et al., Int J Cancer 80: 92-7 (1999)), and so on. 

In spite of significant progress in basic and clinical research concerning TAAs 

15 (Rosenberg et al.. Nature Med 4: 321-7 (1998); Mukherji et al., Proc Nad Acad Sci USA 92: 
8078-82 (1995); Hu et al., Cancer Res 56: 2479-83 (1996)), only limited number of candidate 
TAAs for the treatment of adenocarcinomas, including colorectal cancer, are currently 
available. TAAs abundantly expressed in cancer cells yet whose expression is restricted to 
cancer cells would be promising candidates as immunotherapeutic targets. Further, 

20 identification of new TAAs inducing potent and specific anti-tumor immune responses is 
expected to encourage clinical use of peptide vaccination strategies for various types of 
cancer (Boon and can der Bruggai, J Exp Med 183: 725-9 (1996); van der Bruggen et al., 
Science 254: 1643-7 (1991); Brichard et al., J Exp Med 178: 489-95 (1993); Kawakami et 
al.,J Exp Med 180: 347-52 (1994); Shichijo et al., J Exp Med 187: 277-88 (1998); Chen et al., 

25 Proc Natl Acad Sci USA 94: 1914-8 (1997); Harris, J Natl Cancer Inst 88: 1442-5 (1996); 
Butterfield et al.. Cancer Res 59: 3134-42 (1999); Vissers et al., Cancer Res 59: 5554-9 
(1999); van der Burg et al., J Immunol 156: 3308-14 (1996); Tanaka et al., Cancer Res 57: 
4465-8 (1997); Fujie et al., Int J Cancer 80: 169-72 (1999); Kikuchi et al., hit J Cancer 81: 
459-66 (1999); Oiso et al., Int J Cancer 81: 387-94 (1999)). 

30 It has been repeatedly reported that peptide-stimulated peripheral blood mononuclear 

cells ^BMCs) fi"om certain healthy donors produce significant levels of IFN-a in response to 
the peptide, but rarely exert cytotoxicity against tumor cells in an HLA-A24 or -A0201 
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restricted manner in ^^Cr-release assays (Kawano et al., Cancer Res 60: 3550-8 (2000); 
Nishizaka et al., Cancer Res 60: 4830-7 (2000); Tamura et al., Jpn J Cancer Res 92: 762-7 
(2001)). However, both of HLA-A24 and HLA-A0201 are popular HLA alleles in the 
Japanese, as well as the Caucasian populations (Date et al.. Tissue Antigens 47: 93-101 

5 (1996); Kondo et al., J Immunol 155: 4307-12 (1995); Kubo et al., J Immunol 152: 3913-24 
(1994); Imanishi et al., Proceeding of the eleventh International Histocompatibility Workshop 
and Conference Oxford University Press, Oxford, 1065 (1992); Williams et al.. Tissue 
Antigen 49: 129 (1997)). Thus, antigenic peptides of carcinomas presented by these HLAs 
may be isspecially useful for the treatment of carcinomas among Japanese and Caucasians. 

10 Further, it is known that the induction of low-aJBBnity CTL in vitro usually results from the 
use of peptide at a high concentration, generating a high level of specific peptide/MHC 
complexes on antigen presenting cells (APCs), which will effectively activate these CTL 
(Alexander-Miller et al., Proc Natl Acad Sci USA 93: 4102-7 (1996)). 

Accordingly, in an ejSbrt to understand the carcinogenic mechanisms associated with 

15 cancer and identify potential targets for developing novel anti-cancer agents, large scale 
analyses of gene expression patterns in purified populations of breast cancer cells were 
performed using a cDNA microarray representing 23,040 genes. More particularly, using a 
combination of cDNA microarray and laser beam microdissection, precise genome-wide 
expression profiles of 81 breast tumors were examined, including 12 ductal carcinomas in situ 

20 (DCIS) and 69 invasive ductal carcinomas (BDC). Among the up-regulated genes 

identified, the following three genes were selected for being significantly over-expressed in 
breast cancer cells: A5657 (previously referred to as AF161499); B9769 (previously referred 
to as AA156269); and C7965 (previously referred to as AW977394). The findings discussed 
in detail herein suggest that these genes play key roles in tumor cell growth proliferation, and 

25 are therefore promising targets for development of anti-cancer drugs. 
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SUMAfARY OF THE INVENTION 

The present invention relates to the discovery of the following three genes whose 
expression was significantly up-regulated in 38 of 49, 30 of 73 and 28 of 49 breast cancer 
cases assayed, respectively: A5657 (SEQ ID NO: 1), encoding an HSPC150 protein (SEQ ID 

5 NO: 2) similar to ubiquitin-conjugating enzyme; B9769 (SEQ ID NO: 3), encoding 

hypothetical protein BC016861 (SEQ ID NO: 4); and C7965 (SEQ ID NO: 5), designated as 
LOC90557 (SEQ ID NO: 6). Subsequent semi-quantitative RT-PCR and Northern blot 
analysis confirmed that ^5557, B9769 and C7965 were significantly over-expressed in 
clinical breast cancer samples and breast cancer cell lines as compared to normal human 

10 tissues, including breast ductal cells and normal breast. In particular, B9769 was highly 

expressed in ERa positive breast cancer cell lines, hnmunocytochemical staining showed that 
exogenous A5657, B9769 and C7965 localized to the cytoplasmic and/or nucleus apparatus, 
the cytoplasmic, and the cytoplasmic apparatus, respectively, in breast cancer cell line, T47D 
cells. In particular, exogenous B9769 was observed in the intermediate filament network in 

15 C0S7 and T47D cells. Furthermore, the A5657 protein was shown to interact with ubiquitin 
by immunoprecipitation binding assay, which suggests that the A5657 protein potentially has 
E2 ubiquitin-enzyme activity. Treatment of breast cancer cells witit small interfering RNAs 
(siRNAs) effectively inhibited expression of A5657, B9769 and C7965 and suppressed 
cell/tumor growth of breast cancer, respectively. In addition, it was discovered that all three 

20 genes ,when transiently over-expressed in NIH3T3 cells, dramatically promoted cell 
proliferation in MTT assay, which suggests that they play key roles in cell growth 
proliferation. These findings suggest that over-expression of A5657. B9769 and C7965 may 
be involved in breast tumorigenesis and may be promising strategies for specific treatment for 
breast cancer patients. 

25 Accordingly, the present invention involves the discovery of that the expression of 

these three genes significantly correlates with breast cancer (BRC). These genes, 
differentially e3q)ressed in breast cancer, are collectively referred to herein as "BRC nucleic 
acids" or "BRC polynucleotides" and the corresponding encoded polypeptides are referred to 
as "BRC polypeptides" or "BRC proteins." 
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Accordingly, the present invention provides a method of diagnosing or determining a 
predisposition to breast cancer in a subject by determining an expression level of a BRC- 
associated gene in a patient-derived biological sample, such as tissue sample. The term 
"BRC-associated gene" refers to a gene that is characterized by an expression level which 

5 differs in a BRC cell as compared to a normal cell. A normal cell is one obtained from breast 
tissue. In the context of the present invention, a BRC-associated gene is one or more genes 
selected from the group consisting o^A5657 (SEQ ID NO: 1), B9769 (SEQ ID NO: 3) and 
C7965 (SEQ ID NO: 5). An alteration, e.g., an increase in the level of expression of a BRC- 
associated gene as compared to a normal control level of the gene, mdicates that the subject 

10 suffers from or is at risk of developmg BRC. 

In the context of the present invention, the phrase "control level" refers to a protein 
expression level detected in a control sample and includes both a normal control level and an 
breast cancer control level. A control level can be a single expression pattern derived from a 
smgle reference population or from a plurality of expression patterns. For example, tiie 

15 control level can be a database of expression patterns from previously tested cells. A 
"normal control level" refers to a level of gene expression detected in a normal, healthy 
individual or in a population of individuals known not to be suffering from breast cancer. A 
normal individual is one with no clinical symptoms of breast cancer. On the other hand, a 
"BRC control level" refers to an expression profile of BRC-associated genes found in a 

20 population suffering from BRC. 

An increase in the expression level of one or more BRC-associated genes, selected 
from the group consisting of A5657, B9769 and C7965, detected in a test sample as compared 
to a normal control level mdicates that the subject (from which the sample was obtained) 
suffers from or is at risk of developing BRC. 

25 Alternatively, expression of a panel of two or more BRC-associated genes in a sample 

can be compared to a BRC control level of the same panel of genes. A similarity between a 
sample expression and BRC control expression indicates that the subject (from which tiie 
sample was obtained) suffers from or is at risk of developuig BRC. 

According to the present mvention, gene expression is deemed "altered" when it is 

30 increased by at least 10%, at least 25%, at least 50% or more as compared to a control level. 
Alternately, gene expression may be deemed to be altered if it is increased at least 1, at least 2, 
at least 5 or more fold as compared to a control level. Gene expression may be determined 
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by detecting hybridization, e.g., on an array, of a BRC-associated gene probe to a gene 
transcript of the patient-derived tissue sample. 

In the context of the present invention, the patient-derived tissue sample can be any 
tissue obtained from a test subject, e,g,, a patient known to or suspected of having BRC. For 

5 example, the tissue may contains an epithelial cell. More particularly, the tissue may be an 
epithelial cell from a breast ductal carcinoma. 

The present invention also provides a BRC reference expression profile, comprising a 
gene expression level of two or more of BRC-associated genes selected from the group 
consisting of A5657, B9769, and C7965. 

10 The present invention forther provides methods of identifying an agent that inhibits 

the expression or activity of a BRC-associated gene, e.g. BRC-associated genes selected from 
the group consisting oiA5657, B9769 and C7965, by contactmg a test cell expressing a BRC- 
associated gene with a test compound and determining the expression level or activity of the 
BRC-associated gene. The test cell may be an epithelial cell, such as an epithelial cell 

15 obtained from a breast carcmoma. A decrease in the expression level or activity a BRC- 
associated gene or its gene product as compared to a normal control level or activity of the 
gene or gene product indicates that the test agent is an inhibitor of the BRC-associated gene 
and may be used to reduce a symptom of BRC, e.g. the expression of a BRC-associated gene 
selected from the group consisting oiA5657, B9769 and C7965. 

20 The present invention also provides a kit comprising a detection reagent which binds 

to one or more BRC nucleic acids or BRC polypeptides. Also provided is an array of nucleic 
acids that binds to one or more BRC nucleic acids. 

Therapeutic methods of the present invention include a method of treating or 
preventing BRC in a subject includmg the step of by administering to the subject an antisense 

25 composition. In the context of the present invention, the antisense composition reduces the 
expression of the specific target gene. For example, the antisense composition may contain a 
nucleotide which is complementary to a BRC-associated gene sequence selected from the 
group consisting of BRC-associated genes A5657, B9769 and C7965. Alternatively, the 
present method may include the steps of adirmiistering to a subject a short interfering KNA 

30 (siRNA) composition. In the context of the present invention, the siRNA composition 
reduces the expression of a BRC nucleic acid selected from the group consisting of BRC- 
associated genes A5657, B9769 and C7965 . In yet another method, the treatment or 
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prevention of BRC in a subject may be carried out by administering to a subject a ribozyme 
composition. In the context of the present invention, the nucleic acid-specific ribozyme 
composition reduces the expression of a BRC nucleic acid selected fix)m the group consisting 
of the BRC-associated genes A5657, B9769 and C7965. 

5 The present invention also includes vaccines aud vaccination methods. For example, a 

method of treating or preventing BRC in a subject may involve administering to the subject a 
vaccine containing a polypeptide encoded by a nucleic acid selected from the group 
consisting of the BRC-associated genes ^55J7, B9769 and C7P55or an unmunologically 
active fragment of such polypeptide. In the context of the present invention, an 

10 immunologically active fragment is a polypeptide that is shorter in length than the full-length 
naturally-occurring protein yet which induces an immune response analogous to that induced 
by the full-length protein. For example, an immunologically active fragment should be at 
least 8 residues in length and capable of stimulating an immune cell such as a T cell or a B 
cell. Immune cell stimulation can be measured by detecting cell proliferation, elaboration of 

15 cytokmes {e.g., IL-2), induction of cytotoxic T lymphocyte, or production of an antibody. 

Unless otherwise defined, all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although methods and materials similar or eqxiivalent to those described herein can 

20 be used in the practice or testing of the present invention, suitable methods and materials are 
described below. All publications, patent applications, patents, and other references 
mentioned herein are incorporated by reference herein in their entirety. In case of conflict, 
the present specification, including definitions, will control. In addition, the materials, 
methods, and examples are illustrative only and not intended to be limiting. 

25 One advantage of the methods described herein is that the disease is identified prior to 

detection of overt clinical symptoms of breast cancer. Other features and advantages of the 
invention will be apparent from the following detailed description, and from the claims. 

Brief Description of the Figures 

Figure 1 depicts the results of semi-quantitative RT-PCR Expression levels of (a) A5657^ 
30 (b) B9769 and (c) C7965 in tumor cells from breast cancer patients (3T, 3 IT, 149T, 175T, 
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43 IT, 453T, 491T, 554T, 571T, 709T, 772T and 781T), breast cancer cell lines (EfflC4, 
HBC5, HBLIOO, HCC1937, MCF7, MDA-MB-231, SKBR3, T47D, YMBl, BT-20, BT.474, 
BT-549, HCC1143, HCC1500, HCC1599, MDA-MB-157, MDA-MB-435S, MDA-MB-453, 
OCUB-F and ZR-75-1), and normal human tissues are shown. Pre refers to normal breast 
5 ductal cells, MG to mammary gland, LUN to lung, LIV to liver, HEA to heart, KID to kidney 
and BM to bone marrow. 

Figure 2 depicts the results of Northem blot analysis of (a) A5657, (b) B9769 and (c) C7965 
transcripts in various human tissues (upper panel), and breast cancer cell lines and normal 
10 human vital organs (bottom panel). 

Figure 3 depcits the subcellular localization of exogenous (a) A5657, (b) B9769 (upper panel; 
low density of cells, bottom panel, high density of cells) and (d) C7965 and control Mock in 
T47D breast cancer cells. Part (c) depicts the subcellular localization of exogenous B9769 as 
15 compared to other cytoskeleton proteins. 

Figure 4 depicts the growth-inhibitory effects of small-interfering RNAs (siRNAs) designed 
to reduce expression of A5657 in breast cancer cells. Part (a) depicts the results of semi- 
quantitative RT-PCR demonstrating suppression of endogenous expression of A5657 in breast 
20 cancer cell (T47D) at 28-day cultures in selective medium containing neomycin after 

introduction .of siRNAs into T47D cells. GAPDH was used as an internal control. Part(b) 
depicts the results of an MTT assay demonstrating a decrease in the numbers of colonies by 
knockdown of A5657 in T47D cells. Part (c) depicts the results of a colony-formation assay 
demonstrating a decrease in the niraibers of colonies by knockdown of A5657 in T47D cells. 

25 

Figure 5 depicts the growth-inhibitory effects pf small-interfering RNAs (siRNAs) designed 
to reduce expression of B9769 in breast cancer cells. Part (a) depicts the results of semi- 
quantitative RT-PCR showing suppression of endogenous expression of B9769 in breast 
cancer cell (T47D) at 28-day cultures in selective medium containing neomycin after 
30 introduction of siRNAs into T47D cells. GAPDH was used as an internal control. Part(b) 
depicts the results of an MTT assay demonstrating a decrease in the numbers of colonies by 
knockdown of B9769 in T47D cells. Part (c) depicts the results of a colony-formation assay 
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demonstrating a decrease in the numbers of colonies by knockdown of B9769 in T47D cells. 

Figure 6 depicts the growth-inhibitory effects of small-interfering RNAs (siRNAs) designed 
to reduce expression of C7 965 in breast cancer cells. Part (a) depicts the results of semi- 

5 quantitative RT-PCR showing suppression of endogenous expression of C7965 in breast 
cancer cell (T47D) at 28-day cultures in selective medium containing neomycin after 
introduction of siRNAs into T47D cells. GAPDH was used as an internal control. Part(b) 
depicts the results of an MTT assay demonstrating a decrease in the numbers of colonies by 
knockdown of C7965 in T47D cells. Part (c) depicts the results of a colony-formation assay 

10 demonstrating a decrease in the numbers of colonies by knockdown of C7965 in T47D cells. 

Figure 7 demonstrates that the over-expression ofA5657, B9769 and C7965 increases the rate 
of cell proliferation. NIH3T3 cells were plated on 6-well plates and traosiently transfected 
withi45557, B9769 and C7965 exjaession vectors as indicated, respectively. MTT assays 
15 were performed to monitor the value of O.D. for cell proliferation at 1, 2, 4 and 6 days. Bars 
show mean ± S.E. (n = 2). 

Figure 8 depicts the ubiquitination of the A5657 protein. Part (A) depicts the results of an in 
vivo ubiquitination assay. Cell lysates were subjected directly to anti-Flag blotting (left) or 
20 immunoprecipitated with anti-HA antibody followed by anti-Flag immunoblotting (right). 
Part (B) demonstrates cell lysates subjected directiy to anti-HA blotting (left) or 
inmiunoprecipitated with anti-FLAG antibody followed by anti-HA immunoblotting (right). 
Ubiquitins were conjugated to FLAG-A5657. An extra band on the Western blot indicated 
A5657 with ubiquitination. 



wo 2005/029067 



PCT/JP2004/011741 



-12- 

Detailed Descrdption 

The words "a", "an" and "the" as used herem mean "at least one" unless otherwise 
specifically indicated. 

Generally breast cancer cells exist as a solid mass having a highly inflammatory 

5 reaction and containing various cellular components. Therefore, previous published 
microarray data are likely to reflect heterogenous profiles. 

With these issues in view, the present inventers prepared purified populations of breast 
cancer cells and normal breast epithelial duct cells by a method of laser-microbeam 
microdissection (LMM). The gene-expression profiles of cancer cells &om 81 BRCs, 

10 including 12 DCISs and 69 IDCs, were analyzed using a cDNA microarray representing 
23,040 genes. By comparing expression patterns between cancer cells firom patients 
diagnosed with BRC and normal ductal epithelial cells purely selected with Laser 
Microdissection, 102 genes (data not shown) were identified as commonly up-regulated in 
BRC cells, and 288 genes (data not shown) were identified as being commonly down- 

15 regulated in BRC cells. Candidate molecular markers having the potential to detect cancer- 
related proteins in serum of patients were selected, and some potential targets for 
development of signal-suppressing strategies in human BRC were discovered. In particular, 
the present invention involves the discovery of changes in expression patterns of three nucleic 
acids, namely ^5tf57, B9769 and C7965, between epithelial cells and carcmomas of patients 

20 with BRC. The A5657 gene (SEQ ID NO: 1) constitutes a novel sequence and encodes an 
HSPC150 protein (SEQ ID NO: 2) similar to ubiquitin-conjugating enzyme (Genbank 
Accession No. NM_014176). A5657 is up-regulated in IDC cells as compared to DCIS cells 
and normal breast epithelial cells. The B9769 gene (SEQ ID NO: 3) constitutes a novel 
sequence and encodes a hypothetical protein (SEQ ID NO: 4) (Genbank Accession No, 

25 NM_138770). B9769 is up-regulated in IDC cells as compared to normal breast epithelial 
cells. The C7965 gene (SEQ ID NO: 5), and its encoded protein (SEQ ID NO: 6) constitute 
known sequences designated as.LOC90557. C7965 is up-regulated in both DCIS cells and 
IDC cells as compared to normal breast epithelial cells. The differentially expressed genes 
identified herein find diagnostic utility as markers of BRC and as BRC gene targets, the 

30 expression of which may be altered to treat or alleviate a symptom of BRC. Alternatively, the 
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A5657 gene, differentially expressed between DCIS and IDC, identified herein finds 
diagnostic utility as a marker for distinguishing IDC fi'om DCIS as well as a BRC gene target, 
the expression of which may be altered to treat or alleviate a symptom of EDO. 

These genes, whose expression level is modulated (ie., increased) in BRC patients, 

5 are collectively referred to herein as "BRC-associated genes" , "BRC nucleic acids" or "BRC 
polynucleotides" and the correspondmg encoded polypeptides are referred to as "BRC 
polypeptides" or "BRC proteins." Unless indicated otherwise, "BRC" refers to any of the 
sequences disclosed herein (e.g., BRC-associated genes selected from the group consisting of 
A5657, B9769, and C7965). 

10 By measuring expression of the various genes in a sample of cells, BRC can be 

diagnosed. Sunilarly, measuring the expression of these genes in response to various agents 
can identify agents for treating BRC. 

The present invention involves determining (e.g., measuring) the expression of at least 
one, and up to aU the BRC-associated genes selected from the group consisting of A5 657, 

15 B9769, and C7965. Usmg sequence information provided by the GenBank™ database entries 
for known sequences, the BRC-associated genes can be detected and measured using 
techniques well known to one of ordmary skill in the art. For example, sequences within the 
sequence database entries corresponding to BRC-associated genes, can be used to construct 
probes for detecting RNA sequences corresponding to BRC-associated genes in, e.g., 

20 Northern blot hybridization analyses. Probes typically include at least 10, at least 20, at least 
50, at least 100, or at least 200 nucleotides of a reference sequence. As another example, the 
sequences can be used to construct primers for specifically amplifying the BRC nucleic acid 
in, e.g., amplification-based detection methods, such as reverse-transcription based 
polymerase chain reaction. 

25 Expression level of one or more of BRC-associated genes in a test cell population, e.g., 

a patient-derived tissues sample, is then compared to the expression level(s) of the same 
gene(s) in a reference population. The reference cell population includes one or more cells 
for which the compared parameter is known, /.e., breast ductal carcinoma cells (e.g., BRC 
cells) or normal breast ductal epithelial cells (e.g., non-BRC cells). 

30 Whether or not a pattem of gene expression in a test cell population as compared to a 

reference cell population indicates BRC or a predisposition thereto depends upon the 
composition of the reference cell population. For example, if the reference cell population is 
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composed of non-BRC cells, a similarity in gene expression pattern between the test cell 
population and the reference cell population indicates the test cell population is non-BRC. 
Conversely, if the reference cell population is made up of BRC cells, a sunilarity in gene 
expression profile between the test cell population and the reference cell population indicates 
5 that the test cell population includes BRC cells. 

A level of expression of a BRC marker gene in a test cell population is considered 
"altered" if it varies firam the expression level of the corresponding BRC marker gene m a 
reference cell population by more than 1.0, more than 1.5, more than 2.0, more than 5.0, more 
than 10.0 or more fold. 

10 Differential gene expression between a test cell population and a reference cell 

population can be normalized to a control nucleic acid, e.g. a housekeeping gene. For 
example, a control nucleic acid is one which is known not to differ depending on the 
cancerous or non-cancerous state of the cell. The expression level of a control nucleic acid 
can be used to normalize signal levels m the in the test and reference popiilations. Exemplary 

15 control genes include, but are not limited to, e.g., p-actin, glyceraldehyde 3- phosphate 
dehydrogenase and ribosomal protein PL 

The test cell population can be compared to multiple reference cell populations. Each 
of the multiple reference populations may differ in the known parameter. Thus, a test cell 
population may be compared to a first reference cell population known to contain, e.g., BRC 

20 cells, as well as a second reference population known to contain, e.g., non-BRC cells (normal 
cells). The test cell may be included in a tissue type or cell sample firom a subject known to 
contain, or suspected of containing, BRC cells. 

The test cell is obtained firom a bodily tissue or a bodily fluid, e.g., biological fluid 
(such as blood or sputum, for example). For example, the test cell may be purified firom 

25 breast tissue. Preferably, the test cell population comprises an epithelial cell. The epithelial 
cell is preferably from a tissue known to be or suspected to be a breast ductal carcinoma. 

Cells in the reference cell population should be derived from a tissue type similar to 
that of the test cell. Optionally, the reference cell population is a cell line, e.g. a BRC cell line 
(i.e., a positive control) or a normal non-BRC cell line (i.e., a negative control). Altematively, 

30 the control cell population may be derived from a database of molecular information derived 
from cells for which the assayed parameter or condition is known. 
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The subject is preferably a mammal. Exemplary mammals include, but are not limited 
to, e.g. , a human, non-human primate, mouse, rat, dog, cat, horse, or cow. 

Expression of the genes disclosed herein can be determined at the protein or nucleic 
acid level, using methods known m the art For example, Norfhem hybridization analysis, 
5 using probes which specifically recognize one or more of these nucleic acid sequences can be 
used to determine gene expression. Alternatively, gene expression may be measured using 
reverse-transcription-based PGR assays, e.g,, using primers specific for the differentially 
expressed gene sequences. Expression may also be determined at the protein level, /.e, by 
measuring the level of a polypeptides encoded by a gene described herein, or the biological 
10 activity thereof Such methods are well known in the art and include, but are not limited to, 
e.g., immunoassays that utilize antibodies to proteins encoded by the genes. The biological 
activities of the proteins encoded by the genes are either generally well known or can be 
routinely identified. 

Novel nucleotides, polypeptides, vectors and host cells: 

15 The present invention provides novel human genes, A5657 and B9769^ whose 

expression is markedly elevated in a great majority of breast carcinomas as compared to 
corresponding non-cancerous breast epithelia. The isolated A5657 gene includes a 
polynucleotide sequence as described in SEQ ID NO: 1, a cDNA sequence containing 928 
nucleotides. In particular, A5657 consists of 7 exons and is located on the chromosome 

20 lq32. 1 spanning approximately 1 0.3 kb in the genome. The cDNA transcript eventually 
encodes a polypeptide of 197 amino acids, an HSPC150 protein sknilar to ubiquitin- 
conjugating enzyme. The isolated B9769 gene includes a polynucleotide as described in SEQ 
ID NO: 3, a cDNA sequence containing 1472 nucleotides. In particular, B9769 consists of 8 
exons and is located on the chromosome 2q21 .2 spanning approximately 5.7 kb in the genome. 

25 The ORF starts at exon 1 and ends at exon 8. Eventually, the cDNA transaipt encodes a 
polypeptide of 378 amino acids. 

In addition to the novel human genes A5657 and B9769, including the polynucleotide 
sequences described in SEQ ID NOs: 1 and 3, the present invention encompasses degenerates 
and mutants thereof, to the extent that they encode an A5657 or B9769 protein, including the 

30 amino acid sequences set forth in SEQ ID NOs: 2 and 4 or functional equivalents thereof. 
Examples of polypeptides functionally equivalent to A5657 or B9769 include, for example. 
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homologous proteins of other organisms corresponding to the human A5657 or B9769 
proteins, as well as mutants of such human proteins. 

The present invention further encompasses polypeptides that are functionally 
equivalent to the polypeptides identified by the present inventors and polynucleotides 

5 encoding such functionally equivalent polypeptides. In the context of the present invention, 
the term "functionally equivalent" means that the subject polypeptide retains a biologically 
significant activity that is characteristic of tihie A5657 or B9769 protein, the amino acid 
sequences of which are shown in SEQ ID NOs: 2 and 4, respectively. For example, both the 
A5657 and B9769 genes are characterized as being specifically overexpressed in BRC cells as 

10 compared to normal cells. The A5657 gene is further overexpressed in IDC cells as compared 
to DCIS cells. Moreover, their overexpression is demonstrated herein to promote cell 
proliferation. Accordingly, in the context of the present invention, a functional equivant of 
the A5657 or B9769 protein should have a cell proliferative activity analogous to that of the 
wild-type protein. Cell proliferation is a parameter that may be measured using assays and 

15 technigues conventional in the art (e.g., the MTT assay discussed in the Examples below). In 
addition, the A5657 protein is demonstrated herein to have E2 ubiquitui enzyme activity and 
to bind to ubiquitin. Accordingly, in the context of the instant invention, a functional 
equivant of the A5657 protein should have ubiquitin enzyme and/or binding activities 
analogous to that of the wild-type protein. 

20 Accordingly, the present invention contemplates certain mutations or variants of the 

disclosed sequences. For example, the present invention encompasses polynucleotides 
encoding a protein of SEQ ID NO: 2 or 4, in which one or more amino acids are substituted, 
deleted, inserted and/or added, so long as the resulting protein retains a biologically 
significant activity of the wild-type protein. In a preferred embodiment, the functionally 

25 eqixivalent protein encoded by the polynucleotide of the present invention is similarly 

overexpressed in BRC cells as compared to normal cells. The present invention also includes 
polynucleotides that hybridize under stringent conditions with an A5657 or B9769 DNA, 
including the nucleotide sequence of SEQ ID NOs: 1 and 3, respectively, so long as the 
resulting polynucleotide encodes a protein that is functionally equivalent to the A5657 or 

30 B9769 protein. The determination of biologically significant activity can be conducted by 
methods well known to those skilled in the art, including methods described herein (see 
Examples section). 
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The invention provides an isolated polynucleotide that encodes a polypeptide 
described herein or a fragment thereof. Preferably, the isolated polypeptide is encoded by a 
nucleotide sequence that is at least about 60% identical to the nucleotide sequence shown in 
SEQ ID NO: 1 or 3. More preferably, the isolated nucleic acid molecule is at least about 65%, 

5 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more, 
identical to the nucleotide sequence shown in SEQ ID NO: 1 or 3. 

To determine the percent identity of two nucleic acids, the sequences are aligned for 
optimal comparison purposes. The percent identity between the two sequences is a function 
of the number of identical positions shared by the sequences (i.e., percent identity = number 

10 of identical positions/total number of positions (e.g., overlapping positions) multiplied by 
100). The percent identity between two sequences can be determined using conventional 
techniques such as to those described herein, with or without allowing gaps. In calculating 
percent identity, typically exact matches are counted. 

The determination of percent identity between two sequences can be accomplished 

15 using any conventional mathematical algorithm, such as the BLAST algoritimi by Karlin and 
Altschul (S. Karlin and S.F. Altschul, Proc. Nati, Acad. Sci. USA. 1990, 87: 2264-2268; S. 
Karlin and S.F. Altschul, Pr^c. Nati. Acad. Sci. USA. 1993, 90: 5873-5877). The BLAST 
algorithm is incorporated into the BLASTN and BLASTX programs of Altschul et al. (S.F. 
Altschul et al., J. MoL Biol. 1990, 215: 403). When a nucleotide sequence is analyzed 

20 according to BLASTN, suitable parameters include, for example, a score= 100 and word 
length= 12. On the other hand, suitable parameters for the analysis of amino acid sequences 
by BLASTX include, for example, a score= 50 and word length= 3. To obtain gapped 
alignments for comparison purposes, Gapped BLAST can be utilized as described in Altschul 
et al. (1997) Nucleic Acids Res. 25:3389. Altematively, PSI-Blast can be used to perform an 

25 iterated search that detects distant relationships between molecules. When utilizing BLAST, 
Gapped BLAST, and PSI-Blast programs, the default parameters of the respective programs 
(e.g., XBLAST and NBLAST) are preferably used. However, one skilled in the art can 
readily adjust the parameters to suit a particular purpose. 

Another example of a mathematical algorithm that may be utilized for the 

30 comparison of sequences is the algorithm of Myers and Miller (1988) CABIOS 4:1 1-17. Such 
an algorithm is incorporated into the ALIGN program (version 2.0), which is part of the GCG 
sequence alignment software package. When utilizing the ALIGN program for comparing 
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amino acid sequences, a PAM120 weight residue table, a gap length penalty of 12, and a gap 
penalty of 4 can be used. 

Methods for preparing polypeptides fiinctionally equivalent to a given protein are well 
known to those skilled in the art and include convraitional methods of introducing mutations 

5 into a protein. For example, one skilled in the art can prepare polypeptides functionally 
equivalent to the A5657 or B9769 protein by introducing an appropriate mutation in the 
amino acid sequence of either of these proteins by site-directed mutagenesis (Hashimoto- 
Gotoh et al.. Gene 152:271-5 (1995); ZoUer and Smith, Methods Enzymol 100: 468-500 
(1983); Kramer et al., Nucleic Adds Res. 12:9441-9456 (1984); Kramer and Fritz, Methods 

10 Enzymol 154: 350-67 (1 987); Kunkel, Proc Natl Acad Sci USA 82: 488-92 (1985); Kunkel, 
Methods Enzymol 85: 2763-6 (1988)). Mutated or modified proteins, protems having amino 
acid sequences modified by substituting, deleting, inserting, and/or adding one or more amino 
acid residues of a certain amino acid sequence, have been known to retain the original 
biological activity (Mark et al., Proc Natl Acad Sci USA 81: 5662-6 (1984); Zoller and Smith, 

15 Nucleic Acids Res 10:6487-500 (1982); Dalbadie-McFarland et al., Proc Natl Acad Sci USA 
79: 6409-13 (1982)). Amino acid mutations can occur in nature, too. Accordingly, the 
present invention includes protems having the amino acid sequences of the A5657 or B9769 
protein in which one or more amino acids are mutated, provided the resulting mutated 
polypeptide is fimctionally equivalent to the wild-type A5657 or B9769 protein. 

20 The number of amino acids that may be mutated is not particularly restricted, so long 

as biologically significant activity is maintained. Generally, up to about 50 amino acids may 
be mutated, preferably up to about 30 amino acids, more preferably xxp to about 10 amino 
acids, and even more preferably up to about 3 to 5 amino acids. likewise, the site of 
mutation is not particularly restricted, so long as the mutation does not result in the disruption 

25 of biologically significant activity. 

Amino acid substitutions may be made at one or more predicted, preferably 
nonessential amino acid residues. A "nonessential" amino acid residue is a residue that can be 
altered fiom the wild-type sequence of a protein (e.g., the sequences shown in SEQ ID NOs: 2 
and 4) without altering the biological activity, vy^ereas an "essential" amino acid residue is 

30 required for biological activity. The amino acid residue to be mutated is preferably substituted 
for a different amino acid that allows the properties of the amino acid side-chain to be 
conserved (a process known as conservative amino acid substitution). Herein, the phrase 
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"conservative amino acid substitution" refers to the replacement of an amino acid residue is 
replaced with an amino acid residue having a chemically sinular side chain. Groups of amino 
acid residues having similar side chains have been defined m the art. Examples of amino acid 
groups include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side 

5 chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, 
asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., 
alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta- 
branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., 
tyrosine, phenylalanine, tryptophan, histidine). Additional examples of amino acid groupings 

10 include side chains having the following chains properties, characteristics, and/or fimctional 
groups or characteristics in common: hydrophobic amino acids (A, I, L, M, F, P, W, Y, V), 
hydrophilic amino acids (R, D, N, C, E, Q, G, H, K, S, T), aliphatic side-chains (G, A, V, L, I, 
P); ahydroxyl containing side-chains (S, T, Y); sulfur atom containing side-chains (C, M); 
carboxylic acid and amide containing side-chains (D, N, E, Q); base containing side-chains (R, 

15 K, H); and aromatic containing side-chains (H, F, Y, W). Note, the parenthetic letters 

indicate the one-letter codes of amino acids. The niunber of amino acids that may be 

mutated is not particularly restricted, so long as the activity is maintained. Generally, up to 
about 50 amino acids may be mutated, preferably up to about 30 amino acids, more preferably 
up to about 10 amino acids, and even more preferably up to about 3 ammo acids. Likewise, 

20 the site of mutation is not particularly restricted, so long as the mutation does not result in the 
disnq)tion of the activity. Alternatively, the number of mutations typically corresponds to 
30% or less, or 20% or less, or 10% or less, preferably 5% or less, or 3% or less of the total 
amino acids, more preferably 2% or less or 1% or less of the total amino acids. 

An example of a polypeptide in which one or more amino acids residues are added to 

25 the amino acid sequence of an A5657 or B9769 protein is a fusion protein containing the 
A5657 or B9769 protein. Fusion proteins are fiisions of an A5657 or B9769 protein with 
other peptides or proteins, and are included in the present invention. Fusion proteins can be 
made by techniques well known to a person skilled in the art, such as by linking the DNA 
encoding an A5657 or B9769 protein of the present invention with DNA encoding other 

30 peptides or protems, so that the frames match, inserting the fusion DNA into an expression 
vector and expressing it in a host. There is no restriction as to the peptides or proteins fused 
to the protein of the present invention. 
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Known peptides that can be used as peptides that are fused to the protein of the present 
invention include, for example, FLAG (Hopp et al., Biotechnology 6: 1204-10 (1988)), 6xHis 
containing six His (histidine) residues, lOxIfis, Influenza agglutinin (HA), human c-myc 
fragment, VSP-GP fragment, plSHIV fragment, T7-tag, HSV-tag, E-tag, SV40T antigen 

5 fragment, Ick tag, a-tubulin fragment, B-tag, Protein C fragment, and the like. Examples of 
proteins that may be fiised to a protein of the invention include GST (glutathione-S- 
transferase). Influenza agglutinin (HA), unmimoglobulin constant region, p-galactosidase, 
MBP (maltose-binding protein), and such. 

Fusion proteins can be prepared by ftising commercially available DNA, encoding the 

10 ftision peptides or proteins discussed above, with the DNA encoding the polypeptide of the 
present invention and expressing the fiised DNA prepared. 

An altemative method known in the art for isolating fimctionally equivalent 
polypeptides mvolves conventional hybridization techniques (Sambrook et al.. Molecular 
Cloning 2nd ed. 9.47-9.58, Cold Spring Harbor Lab. Press (1989)). One skilled in the art can 

15 readily isolate a DNA having high homology with a whole or part of the DNA sequence 
encoding the human A5657 or B9769 protein (i.e., SEQ ID NO: 1 or 3), and isolate 
polypeptides fimctionally equivalent to the human A5657 or B9769 protein from the isolated 
DNA. The polypeptides of the present invention include those that are encoded by DNA fliat 
hybridize with a whole or part of the DNA sequence encoding the A5657 or B9769 protein 

20 and are fimctionally equivalent to the A5657 or B9769 protein. These polypeptides include 
mammal homologues corresponding to the protein derived from human (for example, a 
polypeptide encoded by a monkey, rat, rabbit and bovine gene). 

The condition of hybridization for isolating a DNA encoding a polypeptide 
fimctionally equivalent to the A5657 or B9769 protein can be routinely selected by a person 

25 skilled in the art For example, hybridization may be performed by conducting 

prehybridization at 68°C for 30 min or longer using "Rapid-hyb buffer" (Amersham LIFE 
SCIENCE), adding a labeled probe, and warming at 68°C for 1 hour or longer. The following 
washing step can be conducted, for example, m a low stringent condition. A low stringent 
condition is, for example, 42"C, 2X SSC, 0.1% SDS, or preferably 50*C, 2X SSC, 0.1% SDS. 

30 More preferably, high stringent conditions are used. A high stringent condition is, for 
example, washing 3 times in 2X SSC, 0.01% SDS at room temperature for 20 mm, then 
washing 3 times in Ix SSC, 0.1% SDS at 37°C for 20 min, and washing twice in Ix SSC, 
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0.1% SDS at 50°C for 20 min. However, several factors, such as temperature and salt 
concentration, can influence the stringency of hybridization and one skilled in the art can 
suitably select the factors to achieve the requisite stringency. 

In place of hybridization, a gene amplification method, for example, the polymerase 
• 5 chain reaction (PGR) method, can be utilized to isolate a DNA encoding a polypeptide 

functionally equivalent to the A5657 or B9769 protein, using a primer synthesized based on 
the sequence information of the protein encoding DNA (SEQ ID NO: 1 or 3). 

Polypeptides that are functionally equivalent to the A5657 or B9769 protein encoded 
by the DNA isolated through the above hybridization techniques or gene amplification 

10 techniques, normally have a high homology to the amino acid sequence of the hxmian A5657 
or B9769 protein. Herein, "high homology" typically refers to a homology of 40% or higher, 
preferably 60% or higher, more preferably 80% or higher, even more preferably 85%, 90% or 
95% or higher. The homology of a polypeptide can be determined by following the algorithm 
in "Wilbur and Lipman, Proc Natl Acad Sci USA 80: 726-30 (1983)". 

15 A polypeptide of the present invention may have variations in amino acid sequence, 

molecular weight, isoelectric point, the presence or absence of sugar chains, or form, 
depending on the cell or host used to produce it or the purification method utilized. 
Nevertheless, so long as it has a function equivalent to that of an A5657 or B9769 protein of 
the present invention, it is within the scope of the present invention. 

20 The polypeptides of the present invention can be prepared as recombinant proteins or 

natural proteins, by methods well known to those skilled in the art. A recombmant protein 
can be prepared by inserting a DNA, vMch encodes the polypeptide of the present invention 
(for example, the DNA comprising the nucleotide sequence of SEQ ID NO: 1 or 3), into an 
appropriate expression vector, introducing the vector into an appropriate host cell, obtaining 

25 the extract, and purifying the polypeptide by subjecting the extract to chromatography, for 
example, ion exchange chromatography, reverse phase chromatography, gel filtration, or 
aflBnity chromatography utilizing a column to which antibodies against the protein of the 
present invention is fixed, or by combining more than one of aforementioned columns. 
Also when the polypeptide of the present invention is expressed within host cells (for example, 

30 animal cells and E. coli) as a fiision protein with glutathione-S-transferase protein or as a 
recombinant protein supplemented with multiple histidines, the expressed recombinant 
protein can be purified using a glutathione colunm or nickel column. Altematively, when the 
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polypeptide of the present invention is expressed as a protein tagged with c-myc, multiple 
histidines, or FLAG, it can be detected and purified using antibodies to c-myc, His, or FLAG, 
respectively. 

After purifying the fusion protein, it is also possible to exclude regions other than the 

5 objective polypeptide by cutting with thrombin or factor-Xa as required. A natural protein 
. can be isolated by methods known to a person skilled in the art, for example, by contacting 
the afBnity column, in which antibodies binding to the A5657 or B9769 protein described 
below are boimd, with the extract of tissues or cells expressing the polypeptide of the present 
invention. The antibodies can be polyclonal antibodies or monoclonal antibodies. 

10 The present invention also encompasses partial peptides of polypeptides of the present 

invention. The partial peptide has an amino acid sequence specific to a polypeptide of the 
present invention and consists of at least 7 amino acids, preferably 8 amino acids or more, and 
more preferably 9 amino acids or more. The partial peptide can be used, for example, for 
preparing antibodies against the polypeptide of the present invention, screening for a 

15 compound that binds to the polypeptide of the present invention, and screening for 
accelerators or inhibitors of the polypeptide of the present invention. 

A partial peptide of the invention can be produced by genetic engineering, by known 
methods of peptide synthesis, or by digesting the polypeptide of the invention with an 
appropriate peptidase. For peptide synthesis, for example, solid phase synthesis or liquid 

20 phase synthesis may be used. 

Furthermore, the present invention provides polynucleotides encoding the polypeptide 
of the present invention. Examples of polynucleotides of the present invention include DNA 
comprising the nucleotide sequence of SEQ ED NOs: 1 or 3. The polynucleotides of tiie 
present invention can be used for the in vivo or in vitro production of the polypeptide of tiie 

25 present invention as described above, or can be applied to gene therapy for diseases attributed 
to genetic abnormality in the gene encoding the protein of the present invention. Any form of 
the polynucleotide of the present invention can be used so long as it encodes the polypeptide 
of the present invention, including mRNA, RNA, cDNA, genomic DNA, chemically 
synthesized polynucleotides. The polynucleotide of the present invention include a DNA 

30 comprising a given nucleotide sequences as well as its degenerate sequences, so long as the 
resulting DNA encodes a polypeptide of the present invention. 

A polynucleotide of the present invention is preferably isolated. As used herein, an 



wo 2005/029067 



PCT/JP2004/011741 



-23- 

"isolated polynucleotide" is a polynucleotide removed from its original environment (e.g., the 
natural environment if naturally occurring) and thus, altered by the "hand of man" from its 
natural state, the structure of which is not identical to that of any naturally occurring nucleic 
acid or to that of any fragment of a naturally occurring genomic nucleic acid sp annin g more 

5 than three genes. The term therefore covers, for example, (a) a DNA which fragment has the 
sequence of part of a naturally occurring genomic DNA molecule free of the codmg 
sequences that naturally flank the nucleic acid (i-e,, sequences located.at the 5* and 3' ends of 
the nucleic acid) in the genonoic DNA but is not flanked by both of the coding sequences that 
flank that part of the molecule in the genome of the organism in which it naturally occurs; (b) 

10 a nucleic acid incorporated into a vector or iato the genomic DNA of a prokaryote or 
eukaryote in a manner such that the resulting molecule is not identical to any naturally 
occurring vector or genomic DNA; (c) a separate molecule such as a cDNA, a genomic 
fragment, a fragment produced by polymerase chain reaction (PGR), or a restriction fragment; 
and (d) a recombinant nucleotide sequence that is part of a hybrid gene, i.e., a gene encoding 

15 a ftision protein. Specifically excluded from this definition are nucleic acids present in 
random, uncharacterized mixtures of different DNA molecules, transfected cells, or cell 
clones, e.g., as these occur in a DNA library such as a cDNA or genomic DNA library. 

Polynucleotides of the present mvention can be isolated by methods well known to 
those skilled in the art. Exemplary methods include the hybridization techniques discussed 

20 above (E.M. Southern, J. MoL Biol. 1975, 98: 503-517) as well as the polymerase chain 
reaction (PGR) technique (R.K. Saiki et al.. Science 1985, 230: 1350-1354; ILK. Saiki et al., 
Science 1988, 239: 487-491). More specifically, those skilled in the art can generally isolate 
polynucleotides highly homologous to the polynucleotide shown in SEQ ID NOs: 1 and 3 
from other animals (such as human), using the polynucleotide shown in SEQ ID NOs: 1 and 3 

25 or a part thereof as probes or using the oligonucleotide which specifically hybridizes with the 
polynucleotide shown in SEQ ID NOs: 1 and 3 as primers. Furthermore, polynucleotides that 
can be isolated by hybridization techniques or PGR techniques and that hybridize with 
polynucleotides shown in SEQ ID NOs: 1 and 3 are also included in the polynucleotides of 
the present invention. 

30 Hybridization reactions to isolate polynucleotides as described above are preferably 

conducted under stringent conditions. Hybridization may be perforaied with buffers that 
peraiit the formation of a hybridization complex between nucleic acid sequences that contain 
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some mismatches. At high stringency, hybridization complexes will remain stable only where 
the nucleic acid molecules are almost completely complementary. Many factors determine 
the stringency of hybridization, including G+C content of the cDNA, salt concentration, and 
temperature. For example, stringency may be increased by reducing the concentration of salt 

5 or by raising the hybridization temperature. Temperature conditions for hybridization and 
washing greatly influence stringency and can be adjusted using melting temperature (Tm). 
Tm varies with the ratio of constitutive nucleotides in the hybridizing base pairs, and with the 
composition of the hybridization solution (concentrations of salts, formamide and sodiirai 
dodecyl sulfate). In solutions used for some membrane based hybridizations, addition of an 

10 organic solvent, such as formamide, allows the reaction to occur at a lower temperature. 
Accordingly, on considering the relevant parameters, one skilled in the art can select 
appropriate conditions to achieve a suitable stringency based experience or experimentation. 

Examples of stringent hybridization conditions includes conditions comprising: 6 M 
urea, 0.4% SDS and 0.5x SSC, and those having a stringency equivalent to the conditions. 

15 Polynucleotides with higher homology are expected to be isolated when hybridization is 
performed under conditions with higher stringency, for example, 6 M urea, 0.4% SDS and 
O.lx SSC. Polynucleotides isolated under higher stringency conditions, such as described 
above, are expected to encode a polypeptide having a higher homology at the amino acid level 
to the amino acid sequence shown in SEQ ID NOs: 2 and 4. As noted above, "high 

20 homology" refers to an identity of at least 40% or more, preferably 60% or more, more 

preferably 80% or more, and even more preferably 85%, 90% or 95% or more, in the whole 
amino acid sequence. 

The present invention also includes allelic variants of the polynucleotides shown in 
SEQ ID NOs: 1 and 3. An allele is one of two or more alternate forms of a gene occupying 

25 the same locus in a particular chromosome or linkage structure and differing from other 

alleles of the locus at one or more mutational sites. Accordingly, an allelic variant comprises 
an alteration in the normal sequence of a gene. Complete gene sequencing often identifies 
numerous allelic variants (sometimes hundreds) for a given gene. Allelic variants have a high 
percent identity to the original polynucleotide and may dijBfer, for example, by about three 

30 bases per hundred bases. Allelic variants generally encode substantially identical proteins and 
typically do not affect the resulting phenotype. Allelic variants of a particular polynucleotide 
can be routinely obtamed, for example, via the hybridization techniques discussed above. 
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Examples of allelic variants include genetic polymorphisms. Polymorphisms 
comprise differences in DNA sequences that exist among individuals. Genetic variations 
occurring in more than 1% of a population are considered useful polymorphisms for genetic 
linkage analysis. Certain polymorphisms (e.g., restriction fragment length polymorphisms 

5 (RFLP)) result in variations among individuals in DNA fragment sizes cut by specific 

©Qzymes, such as restriction enzymes. Polymorphic sequences that result in RFLPs are useful 
as markers on both physical maps and genetic linkage maps. Another example of a common 
genetic polymorphism is a "single nucleotide polymorphism" (SNP). SNPs comprise changes 
in a single base as a result of a substitution, insertion or deletion. The change may be 

10 conservative (purine for purine) or non-conservative (purine to pyrimidine) and may or may 
not result in a change in an encoded amino acid. Such changes may predispose an individual 
to a specific disease or condition. 

The polynucleotide of the present invention can beprepared by methods known to a 
person skilled in the art. For example, the polynucleotide of the present invention can be 

15 prepared by: preparing a cDNA library from cells which express the polypeptide of the 
present invention, and conducting hybridization using a partial sequence of a DNA of the 
present invention (for example, SEQ ID NOs: 1 or 3) as a probe. A cDNA library can be 
prepared, for example, by the method described in Sambrook et al., Molecular Cloning, Cold 
Spring Harbor Laboratory Press (1989); alternatively, commercially available cDNA libraries 

20 may be used. A cDNA library can be also prepared by: extracting RNAs from cells 

ejcpressing the polypeptide of the present invention, synthesizing oligo DNAs based on the 
sequence of a DNA of tiie present invention (for example, SEQ ID NOs: 1 or 3), conducting 
PGR using the oligo DNAs as primers, and amplifying cDNAs encoding the protein of the 
present invention. 

25 In addition, by sequencing the nucleotides of the obtained cDNA, the translation 

region encoded by the cDNA can be routinely determmed, and the amino acid sequence of the 
polypeptide of tiie present invention can be easily obtained. Moreover, by screening tiie 
genomic DNA library using the obtained cDNA or parts thereof as a probe, the genomic DNA 
can be isolated. 

30 The obtained mKNA can be used to synthesize cDNA using reverse transraiptase. 

cDNA may be synthesized using a commercially available kit, such as the AMV Reverse 
Transcriptase First-sti-and cDNA Synthesis Kit (Seikagaku Kogyo). Alternatively, cDNA 
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may be syntiiesized and amplified following the 5'-RACE method (Frohman et al., Proc Natl 
Acad Sci USA 85: 8998-9002 (1988); Belyavsky et al., Nucleic Acids Res 17: 2919-32 
(1989)), vMch uses a primer and such, described herein, the 5'-Ampli FINDER RACE Kit 
(Clontech), and polymerase chain reaction (PGR). 
5 A desired DNA fragment can be prepared from the PGR products and ligated with a 

vector DNA. The recombinant vectors are used to transform E, coli and such, and a desired 
recombinant vector is prepared from a selected colony. The nucleotide sequence of the 
desired DNA can be verified by conventional methods, such as dideoxynucleotide chain 
termination. 

10 The nucleotide sequence of a polynucleotide of the invention may be designed to be 

expressed more efficiently by taking into accoimt the frequency of codon usage in the host to 
be used for expression (Grantham et al., Nucleic Acids Res 9: 43-74 (1981)). The sequence 
of the polynucleotide of the present invention may be altered by a commercially available kit 
or a conventional method. For instance, the sequence may be altered by digestion with 

15 restriction enzymes, insertion of a synthetic oligonucleotide or an appropriate polynucleotide 
fragment, addition of a linker, or insertion of the initiation codon (ATG) and/or the stop 
codon (TAA, TGA, or TAG). 

The present invention also provides a vector into which a polynucleotide of the 
present invention is inserted. A vector of the present invention is usefiil to keep a 

20 polynucleotide, especially a DNA, of the present invention in host cell, to express the 
polypeptide of the present invention, or to administer the polynucleotide of the present 
invention for gene therapy. 

When E. coli is a host cell and the vector is amplified and produced in a large amount 
in E. coli (e.g., JM109, DH5a, HBlOl, or XLlBlue), tiie vector should have "ori" to be 

25 amplified in E, coli and a marker gene for selecting transformed E, coli (e.g., a drug- 
resistance gene selected by a drug such as ampicillin, tetracycline, kanamycin, 
chloramphenicol or the like). For example, M13-series vectors, pUC-series vectors, pBR322, 
pBluescript, pCR-Script, etc. can be used. In addition, pGEM-T, pDIRECT, and pT7 can also 
be used for subcloning and extracting cDNA as well as the vectors described above. When a 

30 vector is used to produce the protein of the present invention, an expression vector is 

especially useful. For example, an expression vector to be expressed in JE. coli should have 
the above characteristics to be amplified in £. coli. When E, coli^ such as JM109, DH5a, 
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HBlOl, or XLl Blue, are used as a host cell, the vector should have a promoter, for example, 
lacZ promoter (Ward et al., Nature 341: 544-6 (1989); FASEB J 6: 2422-7 (1992)), araB 
promoter (Better et al.. Science 240: 1041-3 (1988)), or T7 promoter or the like, that can 
eflBciently express the desired gene in E. coli. In that respect, pGEX-5X-l (Pharmacia), 

5 "QIAexpress system" (Qiagen), pEGFP and pET (in this case, the host is preferably BL21 
which expresses T7 RNA polymerase), for example, can be used instead of the above vectors. 
Additionally, the vector may also contain a signal sequence for polypeptide secretion. An 
exemplary signal sequence that directs the polypeptide to be secreted to the periplasm of the E, 
coli is the pelB signal sequence (Lei et al., J Bacteriol 169: 4379 (1987)). Means for 

10 introducing of the vectors into the target host cells include, for example, the calcium chloride 
method, and the electroporation method. 

In addition to E. coli, for example, expression vectors derived from mammals (for 
example, pcDNA3 (Invitrogen) and pEGF-BOS (Nucleic Acids Res 18(17): 5322 (1990)), 
pEF, pCDM8), expression vectors derived from msect cells (for example, "Bac-to-BAC 

15 baculovirus expression system" (GIBCO BRL), pBacPAK8), expression vectors derived from 
plants (e.g., pMHl, pMH2), expression vectors derived from animal viruses (e.g., pHSV, 
pMV, pAdexLcw), expression vectors derived from retroviruses (e.g., pZIpneo), expression 
vector derived from yeast (e.g., "Pichia Expression Kit" (Invitrogen), pNVl 1, SP-QOl), and 
expression vectors derived from Bacillus subtilis (e.g., pPL608, pKTHSO) can be used for 

20 producing the polypeptide of the present iavention. 

In order to express the vector in animal cells, such as CHO, COS, or NIH3T3 cells, the 
vector should have a promoter necessary for expression in such cells, for example, the S V40 
promoter (MuUigan et al.. Nature 277: 108 (1979)), the MMLV-LTR promoter, the EFla 
. promoter (Mizushuna et al.. Nucleic Acids Res 18: 5322 (1990)), the CMV promoter, and the 

25 like, and preferably a marker gene for selectmg transformants (for example, a drug resistance 
gene selected by a drug (e.g., neomycin, 0418)). Examples of known vectors with these 
characteristics include, for example, pMAM, pDR2, pBK-RSV, pBK-CMV, pOPRSV, and 
pOP13. 

In addition, methods may be used to express a gene stably and, at the same time, to 
30 amplify the copy number of the gene in cells. For example, a vector comprising the 

complementary DHFR gene (e.g., pCHO I) may be introduced into CHO cells in which the 
nucleic acid synthesizing pathway is deleted, and then amplified by methotrexate (MTX). 
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Furthermore, in case of transient expression of a gene, the method wherein a vector 
comprismg a replication origin of SV40 (pcD, etc.) is transformed into COS cells comprising 
the S V40 T antigen expressing gene on the chromosome can be used. 

A polypeptide of the present invention obtained as above may be isolated jfrom inside 

5 or outside (such as medium) of host cells, and purified as a substantially pure homogeneous 
polypeptide. The term "substantially pure" as used herem in reference to a given polypeptide 
means that the polypeptide is substantially free from contaminants, such as other biological 
macromolecules, culture media (if recombinantly produced), or chemical precursors (if 
chemically synthesized). The substantially pure polypeptide is at least about 75%, preferably 

10 at least about 80%, more preferably at least about 85, 90, 95, or 99% (e.g., at least 80, 85, 95, 
or 99%) pure by dry weight. Purity can be measured by any appropriate standard 
conventional method, for example by column chromatography, polyacrylamide gel 
electrophoresis, or HPLC analysis. 

For instance, column chromatography, filter, ultrafiltration, salt precipitation, solvent 

15 precipitation^ solvent extraction, distillation, immunoprecipitation, SDS-polyacrylamide gel 
electrophoresis, isoelectric point electrophoresis, dialysis, and recrystallization may be 
appropriately selected and conibined to isolate and purify the polypeptide. 

Examples of chromatography include, for example, affinity chromatography, ion- 
exchange chromatography, hydrophobic chromatography, gel filtration, reverse phase 

20 chromatography, adsorption chromatography, and such (Strategies for Protein Purification 
and Characterization: A Laboratory Course Manual. Ed Daniel R. Marshak et al.. Cold 
Spring Harbor Laboratory Press (1996)). These chromatographies may be performed by 
liquid chromatography, such as HPLC and FPLC. Thus, the present invention provides for 
highly purified polypeptides prepared by the above methods. 

25 A polypeptide of the present invention may be optionally modified or partially deleted 

by treating it with an appropriate protein modification enzyme before or after purification. 
Usefiil protein modification enzymes include, but are not limited to, trypsin, chymotrypsin, 
lysylendopeptidase, protein kinase, glucosidase, and so on. 



30 



Antibodies: 

The present invention ftirther provides antibodies that binds to a polypeptide of the 
invention. An antibody of the present invention can be used in any form, such as monoclonal 



wo 2005/029067 



PCT/JP2004/011741 



-29- 

or polyclonal antibodies, and includes antiserum obtained by immunizing an animal such as a 
rabbit with the polypeptide of the invention, all classes of polyclonal and monoclonal 
antibodies, human antibodies, and humanized antibodies produced by genetic recombination. 

A polypeptide of the invention used as an antigen to obtain an antibody may be 
5 derived jfrom any animal species, but preferably is derived from a mammal such as a human, 
mouse, or rat, more preferably from a human. A himian-derived polypeptide may be obtained 
from the nucleotide or amino acid sequences disclosed herein. 

According to the present invention, the polypeptide to be used as an immunization 
antigen may be a complete protein or a partial peptide of the protein. A partial peptide may 
10 comprise, for example, the ammo O^O-tenninal or carboxy (C)-terminal fragment of a 
polypeptide of the present invention. 

Any mammalian animal may be immunized with the antigen, but preferably the 
compatibility with parental cells used for cell fusion is taken into account. In general, animals 
of the orders Rodentia, Lagomorpha, and Primate are preferably used. Animals of the order 
15 Rodentia include, but are not limited to, for example, mouse, rat, and hamster. Animals of the 
order Lagomorpha mclude, but are not limited to, for example, rabbit Animals of the order 
Primate include, but are not Imiited to, for example, a monkey of the infra-order Catarrhini 
(old world monkey), such as Macaca fascicularis, rhesus monkey, sacred baboon, and 
chimpanzees. 

20 Methods for immunizing animals with antigens are known in the art. Intraperitoneal 

injection or subcutaneous mjection of antigens is a standard method for immunization of 
mammals. More specifically, antigens may be diluted and suspended in an appropriate 
amount of phosphate buffered salme (PBS), physiological saline, etc. If desired, the antigen 
suspension may be mixed with an appropriate amount of a standard adjuvant, such as 

25 Freund's complete adjuvant, made mto emulsion, and then administered to mammalian 
animals. Preferably, it is followed by several administrations of antigen mixed with an 
appropriately amount of Freund's incomplete adjuvant every 4 to 21 days. An appropriate 
carrier may also be used for immunization. After immunization as above, serum is examined 
by a standard method for an increase in the amount of desired antibodies. 

30 Polyclonal antibodies against the polypeptides of the present invention may be 

prepared by collecting blood from the immunized mammal examined for the increase of 
deshed antibodies in the serum, and by separating serum from the blood by any conventional 
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method. Polyclonal antibodies include serum containing the polyclonal antibodies, as well as 
the fraction containing the polyclonal antibodies may be isolated from the serum. 
Immunoglobulin G or M can be prepared from a fraction which recognizes only the 
polypeptide of the present invention using, for example, an affinity column coupled with the 
5 polypeptide of the present invention, and further purifying this fraction using protein A or 
protein G column. 

To prepare monoclonal antibodies, immune cells are collected from the mammal 
immunized with the antigen and checked for the increased level of desired antibodies in the 
serum as described above, and are subjected to cell fusion. The immune cells used for cell 
10 fusion are preferably obtained from spleen. Other preferred parental cells to be fused with the 
above immunocyte include, for example, myeloma cells of mammalians, and more preferably 
myeloma cells having an acquired property for the selection of fused cells by drugs. 

The above immunocyte and myeloma cells can be fused according to known methods, 
for example, the method of Milstein et al. (Galfre and Milstein, Methods Enzymol 73: 3-46 
15 (1981)). 

Resulting hybridomas obtained by the cell fusion may be selected by cultivating them 
in a standard selection medium, such as HAT medium (hypoxanthine, aminopterin, and 
thymidine containing medium). The cell culture is typically continued in the HAT medium 
for several days to several weeks, the time being sufficient to allow all the other cells, with 

20 the exception of the desired hybridoma (non-fused cells), to die. Then, the standard limiting 
dilution is performed to screen and clone a hybridoma cell producing the desired antibody. 

In addition to the above method, in which a non-human animal is immunized with an 
antigen for preparing hybridoma, human lymphocytes such as those infected by EB virus may 
be immunized with a polypeptide, polypeptide expressmg cells, or their lysates in vitro. Then^ 

25 the inmiunized lymphocytes are fused with human-derived myeloma cells that are capable of 
indefinitely dividing, such as U266, to yield a hybridoma producing a desired human antibody 
fliat is able to bmd to the polypeptide can be obtained (Unexamined Published Japanese 
Patent Application No. (JP-A) Sho 63-17688). 

The obtained hybridomas are subsequently transplanted into the abdominal cavity of a 

30 mouse and the ascites are extracted. The obtained monoclonal antibodies can be purified by, 
for example, ammonivun sulfate precipitation, a protein A or protein G column, DEAE ion 
exchange chromatography, or an affinity column to which the polypeptide of the present 
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invention is coupled. The antibody of the present invention can be used not only for 
purification and detection of the polypeptide of the present invention, but also as a candidate 
for agonists and antagonists of flie polypeptide of the present invention. In addition, this 
antibody can be applied to the antibody treatment for diseases related to the polypeptide of the 
5 present invention. When the obtamed antibody is to be administered to the human body 
(antibody treatment), a human antibody or a humanized antibody is preferable for reducing 
immunogenicity. 

For example, transgenic animals havuig a repertory of human antibody genes may be 
immunized with an antigen selected firom a polypeptide, polypeptide expressing cells, or their 
10 lysates. Antibody producing cells are then collected fi:om the animals and fused with 

myeloma cells to obtain hybridoma, from which human antibodies against the polypeptide 
can be prepared (see WO92-03918, W093.2227, WO94-02602, W094-25585, W096-33735, 
andWO96-34096). 

Alternatively, an immune cell, such as an immunized lymphocyte, producing 
15 antibodies may be immortalized by an oncogene and used for preparing monoclonal 
antibodies. 

Monoclonal antibodies thus obtained can be also recombinantly prepared using 
genetic engineering techniques (see, for example, Borrebaeck and Larrick, Therapeutic 
Monoclonal Antibodies, published m the United Kingdom by MacMillan Publishers LTD 

20 (1990)). For example, a DNA encoding an antibody may be cloned from an immune cell, 
such as a hybridoma or an immunized lymphocyte producing the antibody, inserted into an 
appropriate vector, and introduced into host cells to prepare a recombinant antibody. The 
present invention also provides recombinant antibodies prepared as described above. 

Furthermore, an antibody of the present invention may be a fragment of an antibody or 

25 modified antibody, so long as it binds to one or more of the polypeptides of the invention. 
For mstance, the antibody fragment may be Fab, F(ab')2, Fv, or single chain Fv (scFv), in 
which Fv fragments from H and L chains are ligated by an appropriate linker (Huston et al., 
Proc Natl Acad Sci USA 85: 5879-83 (1988)). More specifically, an antibody fragment may 
be generated by treating an antibody with an enzyme, such as papain or pepsin. Alternatively, 

30 a gene encoding the antibody fragment may be constructed, inserted into an expression vector, 
and expressed m an appropriate host cell (see, for example, Co et al., J Immunol 152: 2968-76 
(1994); Better and Horwitz, Methods Enzymol 178: 476-96 (1989); Pluckthun and Skerra, 
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Methods Enzymol 178: 497-515 (1989); Lamoyi, Methods Enzymol 121: 652-63 (1986); 
Rousseaux et al., Methods Enzymol 121 : 663-9 (1986); Bird and Walker, Trends Biotechnol 
9: 132-7 (1991)). 

An antibody may be modified by conjugation with a variety of molecules, such as 
5 polyethylene glycol (PEG), The present invention provides for such modified antibodies. 
The modified antibody can be obtained by chemically modifying an antibody. These 
modification methods are conventional in the field. 

Alternatively, an antibody of the present invention may be obtained as a chimeric 
antibody, between a variable region derived firom nonhuman antibody and the constant region 
10 derived fironi human antibody, or as a humanized antibody, comprising the complementarity 
determining region (CDR) derived from nonhuman antibody, the frame work region (FR) and 
the constant region derived firom human antibody. Such antibodies can be prepared by using 
known technology. 

Antibodies obtained as above may be purified to homogeneity. For example, the 

15 separation and purification of the antibody can be performed according to separation and 
purification methods used for general proteins. For example, the antibody may be separated 
and isolated by the appropriately selected and combined use of colunm chromatographies, 
such as affinity chromatography, filter, ultrafiltration, salting-out, dialysis, SDS 
polyacrylamide gel electrophoresis, isoelectric focusing, and others (Antibodies: A 

20 Laboratory Manual. Ed Harlow and David Lane, Cold Spring Harbor Laboratory (1 988)), but 
are not limited thereto. A protein A column and protein G colunm can be used as the affinity 
column. Exemplary protein A columns to be used include, for example. Hyper D, POROS, 
and Sepharose F.F. (Pharmacia). 

Exemplary chromatography, with the excq)tion of aflSnity includes, for example, ion- 

25 exchange chromatography, hydrophobic chromatography, gel filtration, reverse-phase 

chromatography, adsorption chromatography, and the like (Strategies for Protein Purification 
and Characterization: A Laboratory Course Manual. Ed Daniel R. Marshak et al.. Cold Spring 
Harbor Laboratory Press (1996)). The chromatographic procedures can be carried out by 
liquid-phase chromatography, such as HPLC, and FPLC. 

30 For example, measurement of absorbance, enzyme-linked immunosorbent assay 

(ELISA), enzyme immunoassay (EIA), radioimmunoassay (RIA), and/or 
immunofluorescence may be used to measure the antigen binding activity of the antibody of 
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the invention. In ELIS A, the antibody of the present invention is immobilized on a plate, a 
polypeptide of the uivention is applied to the plate, and then a sample containing a desired 
antibody, such as culture supernatant of antibody producing cells or purified antibodies, is 
applied. Then, a secondary antibody that recognizes the primary antibody and is labeled with 

5 an enzyme, such as alkaline phosphatase, is applied, and the plate is incubated. Next, after 
washing, an enzyme substrate, such as p-nitrophenyl phosphate, is added to the plate, and the 
absorbance is measured to evaluate the antigen binding activity of the sample. A fi-agment of 
the polypeptide, such as a C-terminal or N-terminal firagment, may be used as the antigen to 
evaluate the binding activity of Ihe antibody. BIAcore (Pharmacia) may be used to evaluate 

10 the activity of the antibody according to the present invention. 

Diagnosing breast cancer: 

In the context of the present invention, BRC is diagnosed by measuring the expression 
level of one or more BRC nucleic acids from a test population of cells, (/.e., a patient-derived 

15 biological sample). Preferably, the test cell population contains an epithelial cell, e.g., a cell 
obtained from breast tissue. Gene expression can also be measured from blood or other 
bodily fluids such as urine. Other biological samples can be used for measuring protein levels. 
For example, the protein level in blood or serum derived from a subject to be diagnosed can 
be measured by immunoassay or other conventional biological assay. 

20 Expression of one or more BRC-associated genes, A5657, B9769 and C7965, is 

detCTmined in the test cell or biological sample and compared to the normal control 
expression level associated with the one or more BRC-associated gene(s) assayed. A normal 
control level is an expression profile of a BRC-associated gene typically found in a population 
known not to be suffering from BRC. An alteration (e.g., an mcrease) m the level of 

25 expression in the patient-derived tissue sample of one or more BRC-associated gene indicates 
that the subject is suffering from or is at risk of developing BRC. For example, an increase in 
the expression of one or more of the BRC-associated genes A5657, £9769 and C7965 in the 
test population as compared to the normal control level indicates that the subject is suffering 
from or is at risk of developing BRC. 

30 Increase in the level of expression of one or more of the BRC-associated genes in the 

test population as compared to the normal control level indicates that the subject suffers fix)m 
or is at risk of developing BRC. For example, increase of one, two or three of the panel of 
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BRC-associated genes (e.g., A5657, B9769 and C7965) indicates that the subject suffers jfrom 
or is at risk of developing BRC. 

Identifvins asents that inhibit BRC-associated 2ene expression: 

An agent that inhibits the expression of a BRC-associated gene or the activity of its 

5 gene product can be identified by contacting a test cell population expressing a BRC- 
associated up-regulated gene with a test agent and then determining the expression level or 
activity of the BRC-associated gene. A decrease in the level of expression of the BRC- 
associated gene or in the level of activity of its gene product in the presence of the agent as 
compared to the normal control level (or compared to the expression or activity level in the 

10 absence of the test agent) indicates that the agent is an inhibitor of a BRC-associated gene and 
useful in inhibiting BRC. 

The test cell population may be any cell expressing the BRC-associated genes. For 
example, the test cell popxilatipn may contain an epithelial cell, such as a cell derived firom 
15 breast tissue. Furthermore, the test cell may be an immortalized cell line derived from an 
carcinoma cell. Alternatively, the test cell may be a cell which has been transfected with a 
BRC-associated gene or which has been transfected with a regulatory sequence (e.g. promoter 
sequence) from a BRC-associated gene operably linked to a reporter gene. 

20 Assessing efficacy of treatment of BRC in a subject: 

The differentially e:q)ressed BRC-associated genes identified herein also allow for the 
course of treatment of BRC to be monitored. In this method, a test cell population is provided 
from a subject undergoing treatment for BRC. If desired, test cell populations are obtained 
from the subject at various time points, before, during, and/or after treatment. Expression of 

25 one or more of the BRC-associated genes in the cell population is then determined and 

compared to a reference cell population which includes cells whose BRC state is known. In 
the context of the present invention, the reference cells should have not been exposed to the 
treatment of interest. 

If the reference cell population contains no BRC cells, a similarity in the expression of 
30 a BRC-associated gene in the test cell population and the reference cell population indicates 
that the treatment of interest is efficacious. However, a difference in the expression of a 
BRC-associated gene in the test population and a normal control reference cell population 
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indicates a less favorable clinical outcome or prognosis. Similarly, if the reference cell 
population contains BRC cells, a difference between the e}q)fession of a BRC-associated gene 
in the test cell population and the reference cell population indicates that the treatment of 
interest is eflScacious, while a similarity in the expression of a BRC-associated gene in the test 
S population and a control reference cell population indicates a less favorable clinical outcome 
or prognosis. 

Additionally, the expression level of one or more BRC-associated genes 
determined in a subject-derived biological sample obtained after treatment (i.e., post- 
treatment levels) can be compared to the expression level of the one or more BRC-associated 

10 genes determined in a subject-derived biological sample obtained prior to treatment onset (i.e., 
pre-treatment levels). A decrease in the expression level in a post-treatment sample indicates 
that the treatment of interest is eflBcacious while an increase or maintenance in the expression 
level in the post-treatment sample indicates a less favorable clinical outcome or prognosis. 

As used herein, the term "efGcacious"' indicates that the treatment leads to a reduction 

IS in the expression of a pathologically up-regulated gene, an increase in the e?q)ression of a 
pathologically down-regulated gene or a decrease in size, prevalence, or metastatic potential 
of breast ductal carcinoma in a subject. When a treatment of interest is applied 
prophylactically, the term "efficacious" means that the treatment retards or prevents a breast 
tumor from forming or retards, prevents, or alleviates a symptom of clinical BRC. 

20 Assessment of breast tumors can be made using standard clinical protocols. 

In addition, efficaciousness can be determined in association with any known method 
for diagnosing or treating BRC. BRC can be diagnosed, for example, by identifying 
symptomatic anomalies, e.g., weight loss, abdominal pain, back pain, anorexia, nausea, 
vomiting and generalized malaise, weakness, and jaundice. 

25 

Selecting a therapeutic a^entfor treating BRC that is appropriate for a particular individual: 

Differences in the genetic makeup of individuals can result in differences in their . 
relative abilities to metabolize various dmgs. An agent that is metabolized in a subject to act 
as an anti-BRC agent can manifest itself by inducing a change in a gene expression pattern in 
30 the subject's cells from that characteristic of a cancerous state to a gene expression pattern 
characteristic of a non-cancerous state. Accordingly, the differentially expressed BRC- 
associated genes disclosed herein allow for a putative therapeutic or prophylactic inhibitor of 
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BRC to be tested in a test cell population from a selected subj wt in order to determine if the 
agent is a suitable inhibitor of BRC in the subject 

To identify an inhibitor of BRC that is appropriate for a specific subject, a test cell 
population from the subject is exposed to a therapeutic agent, and the expression of one or 
5 more of the BRC-associated genes A5657, B9769 and C7965 is detemiined. 

In the context of the method of the present invention, the test cell population contains 
a BRC cell expressing a BRC-associated gene. Preferably, the test cell is an epithelial cell. 
For example, a test cell population may be incubated in the presence of a candidate agent and 
the pattern of gene expression of the test sample may be measured and compared to one or 
10 more reference profiles, e.g., a BRC reference expression profile or a non-BRC reference 
expression profile. 

A decrease in expression of one or more of the BRC-associated genes A5657, B9769 
and C7965 relative to a reference cell population containing BRC indicates that the agent has 
therapeutic potential. 

IS In the context of the present invention, the test agent can be any compound or 

composition. Exemplary test agents include, but are not limited to, immunomodulatoiy 
agents. 

Screening assays for identifying therapeutic agents: 

20 The differentially caressed BRC-associated genes disclosed herein can also be used 

to identify candidate therapeutic agents for treating BRC. The method of the present 
invention involves screening a candidate therapeutic agent to determine if it can convert an 
expression profile of one or more of the BRC-associated genes A5657, B9769 and C7965 
characteristic of a BRC state to a gene expression pattem characteristic of a non-BRC state. 

25 In the instant method, a cell is exposed to a test agent or a plurality of test agents 

(sequentially or in combination) and the expression of one or more of the BRC-associated 
genes A5657, B9769 and C7965 in the cell is measured. The expression profile of the BRC- 
associated gene(s) assayed in the test population is compared to e^qpression level of the same 
BRC-associated gene(s) in a reference cell population that is not exposed to the test agent 
J 30 An agent capable of suppressing the expression of over-expressed or up-regulated 

genes has potential clinical benefit. Such agents may be fiirther tested for the ability to 
prevent breast ductal carcinomal growth in animals or test subjects. 
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In a fiirther embodiment, the present invention provides methods for screening 
candidate agents which are potential targets in the treatment of BRC. As discussed in detail 
above, by controlling the expression levels of marker genes or the activities of their gene 
products, one can control the onset and progression of BRC. Thus, candidate agents, which 
5 are potential targets in the treatment of BRC, can be identified through screening methods that 
use such expression levels and activities of as indices of the cancerous or non-cancerous state. 
In the context of the present invention, such screening may comprise, for example, the 
following steps: 

a) contacting a test compoimd with a polypeptide encoded by A5657, B9769 and 
10 C7965\ 

b) detecting the binding activity between the polypeptide and the test compoxmd; and 

c) selecting the test compoimd that binds to the polypeptide. 

Altematively, the screening method of the present invention may comprise the 
following steps: 

15 a) contacting a candidate compoimd with a cell expressing one or more marker genes, 

wherein the one or more marker genes is selected firom the group consisting A5657, 
B9769mdC7965;and 
b) selecting the candidate compound that reduces the expression level of one or more 
marker genes as compared to a control. 
20 Cells expressing a marker gene include, for example, cell lines established from BRC; such 
cells can be used for the above screening of the present invention. 

Altematively, the screening method of the present invention may comprise the 
following steps: 

a) contacting a test compound with a polypeptide encoded by a polynucleotide selected 
25 from the group consisting of the BRC-associated genes A5657, B9769 and C7965; 

b) detecting the biological activity of the polypeptide of step (a); and 

c) selecting a compound that suppresses the biological activity of the polypeptide as 

compared to the biological activity detected in the absence of the test compound. 
A protein for use in the screening method of the present invention can be obtained as a 
30 recombinant protein using the nucleotide sequence of the marker gene. Based on the 
information regarding the marker gene and its encoded protein, one skilled m the art can 
select any biological activity of the protein as an index for screening and any suitable 
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measurement method to assay for the selected biological activity. 

In another embodiment of the method for screening a compound for treating a cell 
proliferative disease of the present invention, the method utilizes biological activity of the 
polypeptide of the present invention as an index. Since the A5657, B9769 and C7965 proteins 

5 of the present invention have the activity of promoting cell proliferation, a compoxmd which 
inhibits this activily of one of these proteins can be screened using this activity as an uidex. 
This screening method includes the steps of: (a) contacting a test compound with a 
polypeptide selected from group consisting of A5657, B9769 and C7965\ (b) detecting the 
biological activity of the polypeptide of step (a); and (c) selecting a compound that suppresses 

10 the biological activity of the polypeptide in comparison with the biological activity detected 
in the absence of the test compound. In the present method, the polypeptide is preferably 
e?q)ressed in a living cell, and the biological activity can be detected by cell proliferation as 
the index. Furtheraiore, the preferable living cell is a host cell transfeted by polynucleotide 
selected from group consisting of A5657, B9769 and C7965. For example, NIH3T3 can be 

IS used as the host cell. 

Any polypeptides can be used for screening so long as they comprise the biological 
activity of the A5657^ B9769 and C7965 protein. Such biological activity include cell- 
proliferating activity of the human A5657, B9769 and C7965 protein. For example, a human 
A5657, B9769 and C7965 protein can be used and polypeptides functionally equivalent to 

20 these proteins can also be used. Such polypeptides may be e?q)ressed endogenously or 
exogenously by cells. 

Alternatively, the screening method of the present invention may comprise the 
following steps: 

a) contacting a candidate compound with a cell into which a vector, comprising the 
25 transcriptional regulatory region of one or more marker genes and a reporter gene 

that is expressed under the control of the transcriptional regulatory region, has been 
introduced, wherein the one or more marker genes are selected from the group 
consisting of the BRC-associated genes A5657, B9769 and C7965 ; 

b) measuring the expression level or activity of said reporter gene; and 

30 c) selecting the candidate compound that reduces the expression level or activity of 
said reporter gene as compared to a control 
Suitable reporter genes and host cells are well known in tiie art A reporter construct 
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suitable for the screening method of the present invention can be prepared by using the 

transcriptional regulatory region of a marker gene. When the transcriptional regulatory region 

of the marker gene is known to those skilled in the art, a reporter construct can be prepared by 

using the previous sequence information. When the transcriptional regulatory region of the 
5 marker gene remains unidentified, a nucleotide segment containing the transcriptional 

regulatory region can be isolated from a genome library based on the nucleotide sequence 

information of the marker gene. 

Any test compoimds, for example, cell extracts, cell culture supernatant, products of 

fermenting microorganism, extracts of marine organism, plant extracts, purified or crade 
10 proteins, peptides, non-peptide compounds, synthetic micromolecular compounds, natural 

compounds, can be used. 

A compound isolated by the screening serves as a candidate for the development of 

drags that mhibit the activity of the protein encoded by marker gene and can be applied to the 

treatment or prevention of breast cancer. 
15 Moreover, compounds m which a part of the structure of the compound inhibiting the 

activity of proteins encoded by marker genes is converted by addition, deletion and/or 

replacement are also included as the compounds obtainable by this screening method of the 

present invention. 

When administrating a compound isolated by the method of the present invention as a 
20 pharmaceutical for humans and other mammals, such as mice, rats, gumea-pigs, rabbits, cats, 
dogs, sheep, pigs, cattie, monkeys, baboons, and chimpanzees, the isolated compoimd can be 
durectiy administered or can be formulated into a dosage form using known pharmaceutical 
preparation methods. For example, according to the need, tiie drags can be taken orally, as 
sugar-coated tablets, capsules, elixirs and microcapsules, or non-orally, in the form of 
25 injections of sterile solutions or suspensions with water or any other pharmaceutically 
acceptable liquid. For example, the compounds can be mixed with pharmaceutically 
acceptable carriers or media, specifically, sterilized water, physiological saline, plant-oils, 
emulsifiers, suspending agents, surfactants, stabilizers, flavoring agents, excipients, vehicles, 
preservatives, binders, and such, in a unit dose form required for generally accepted drag 
30 implementation. The amount of active ingredient contained in such a preparation makes a 
suitable dosage within the indicated range acquirable. 

Examples of additives that can be admixed into tablets and capsules include, but are 
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not limited to, binders, such as gelatin, com starch, tragacanth gum and arable gum; 
excipients, such as crystalline cellulose; swelling agents, such as com starch, gelatin and 
alginic acid; lubricants, such as magnesium stearate; sweeteners, such as sucrose, lactose or 
saccharin; and flavoring agents, such as pepperaiint, Gaultheria adenothrix oil and cherry. 

5 When the unit-dose form is a capsule, a liquid carrier, such as an oil, can be further included 
in the above ingredients. Sterile composites for injection can be formulated following normal 
dmg implementations using vehicles, such as distilled water, suitable for injection. 

Physiological saline, glucose, and other isotonic liquids, includiag adjuvants, such as 
D-sorbitol, D-mannnose, D-mannitol, and sodium chloride, can be used as aqueous solutions 

10 for injection. These can be used in conjunction with suitable solubilizers, such as alcohol, for 
example, ethanol; polyalcohols, such as propylene glycol and polyethylene glycol; and non- 
ionic surfactants, such as Polysorbate 80 (TM) and HCO-50. 

Sesame oil or soy-bean oil can be used as an oleaginous liquid, may be used in 
conjunction with benzyl benzoate or benzyl alcohol as a solubilizer, and may be formulated 

15 with a buffer, such as phosphate buffer and sodium acetate buffer; a pain-killer, such as 

procaine hydrochloride; a stabilizer, such as benzyl alcohol and phenol; and/or an anti-oxidant 
A prepared injection may be filled into a suitable ampoule. 

Methods well known to those skilled in the art may be used to administer the 
pharmaceutical composition of the present invention to patients, for example as an 

20 intraarterial, intravenous, or percutaneous injection or as an intranasal, transbronchial, 
intramuscular or oral administration. The dosage and method of administration vary 
according to the body-weight and age of a patient and the administration method; however, 
one skilled m the art can routinely select a suitable method of aduMnistration. If said 
compound is encodable by a DNA, the DNA can be inserted into a vector for gene therapy 

25 and the vector administered to a patient to perform the therapy. The dosage and method of 
administration vary according to the body-weight, age, and symptoms of the patient; however, 
one skilled in the art can suitably select them. 

For example, although the dose of a compound that binds to a protein of the present 
invention and regulates its activity depends on the symptoms, the dose is generally about 0.1 

30 mg to about 1 GO mg per day, preferably about 1 .0 mg to about 50 mg per day and more 

preferably about 1 .0 mg to about 20 mg per day, when administered orally to a normal adult 
human (weight 60 kg). 
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When administering the compound parenterally, in tiie forai of an injection to a 
noraial adult human (weight 60 kg), although there are some differences according to the 
patient, target organ, symptoms and method of administration, it is convenient to 
intravenously inject a dose of about 0.01 mg to about 30 mg per day, preferably about 0,1 to 
5 about 20 mg per day and more preferably about 0. 1 to about 1 0 mg per day. In the case of 
other animals, the appropriate dosage amount may be routinely calculated by converting to 60 
kgs of body- weight 

Assessinz the prosnosis of a subject with breast cancer: 

10 The present invention also provides a method of assessing the prognosis of a subject 

with BRC including the step of comparing the expression of one or more BRC-associated 
genes in a test cell population to the expression of the BRC-associated genes A5657, B9769 
and C7965 in a reference cell population derived from patients over a spectrum of disease 
stages. By comparing the gene expression of one or more BRC-associated genes in the test 

15 cell population and the reference cell population(s), or by comparing the pattern of gene 
expression over time m test cell populations derived from the subject, the prognosis of the 
subject can be assessed. 

For example, an increase in the expression of one or more of the BRC-associated 
genes A5657, B9769 and C7965 as compared to a noraial control indicates less favorable 

20 prognosis. Conversely, a similarity to the expression of one or more of the BRC-associated 
genes A5657, B9769 and C7965 as compared to normal control indicates a more favorable 
prognosis for the subject. Preferably, the prognosis of a subject can be assessed by comparing 
the expression profile of gene selected form group consisting of the BRC-associated genes 
A5657, B9769 and C7965, The classification score (CS) may be use for the comparing the 

25 expression profile. 

Kits: 

The present invention also includes a BRC-detection reagent, e.g., a nucleic acid that 
specifically binds to or identifies one or more BRC nucleic acids, such as oligonucleotide 
30 sequences which are complementary to a portion of a BRC nucleic acid, or an antibody that 
bind to one or more proteins encoded by a BRC nucleic acid. The detection reagents may be 
packaged together in the form of a kit. For example, the detection reagents may be packaged 



wo 2005/029067 



PCT/JP2004/011741 



-42- 

in separate containers, e.g., a nucleic acid or antibody (either bound to a solid matrix or 
packaged separately with reagents for binding them to the matrix), a control reagent (positive 
and/or negative), and/or a detectable label. Instructions (e.g., written, tape, VCR, CD-ROM, 
etc.) for carrying out the assay may also be included in the kit. The assay format of the kit 

5 may be a Northern hybridization or a sandwich ELIS A, both of which are known in the art. 

For example, a BRC detection reagent may be immobilized on a solid matrix, such as 
a porous strip, to form at least one BRC detection site. The measurement or detection region 
of the porous strip may include a plurality of sites, each containing a nucleic acid. A test strip 
may also contain sites for negative and/or positive controls. Alternatively, control sites may 

10 be located on a separate strip from the test strip. Optionally, the different detection sites may 
contain different amounts of immobilized nucleic acids, i.e., a higher amoimt in the first 
detection site and lesser amoxmts in subsequent sites. Upon the addition of test sample, the 
number of sites displaying a detectable signal provides a quantitative indication of the amount 
of BRC present in the sample. The detection sites may be configured in any suitably 

15 detectable shape and are typically in the shape of a bar or dot spanning the width of a test strip. 
Alternatively, the kit may contain a nucleic acid substrate array comprising one or 
more nucleic acids. The nucleic acids on the array specifically identify one or more nucleic 
acid sequences represented by the BRC-associated genes -45557, B9769 and C7965. The 
expression of 2 or more of the nucleic acids represented by the BRC-associated genes A5657, 

20 B9769 and C7965 may be identified by virtue of the level of binding to an array test strip or 
chip. The substrate array can be on, cg.^ a solid substrate, such as a "chip" described in U.S. 
Patent No.5,744,305, the contents of which are incorporated by reference herein in its entirety. 

Arrays and pluralities: 

25 The present invention also includes a nucleic acid substrate array comprising one or 

more nucleic acids. The nucleic acids on the array specifically correspond to one or more 
nucleic acid sequences represented by the BRC-associated genes A5657, B9769 and C7965. 
The level of expression of 2 or more of the nucleic acids represented by the BRC-associated 
genes A5657, B9769 and C7965 may be identified by detecting nucleic acid binding to the 

30 array. 

The present invention also includes an isolated plurality (z.e., a mixture of two or more 
nucleic acids) of nucleic acids. The nucleic acids may be in a liquid phase or a solid phase, 
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e.g., immobilized on a solid support such as a nitrocellulose membrane. The plurality 
includes one or more of the nucleic acids represented by the BRC-associated genes A5657, 
B9769 and C7965. In various embodiments, the plurality includes 2 or more of the nucleic 
acids represented by the BRC-associated genes listed mA5657, B9769 and C7965. 

5 

Methods of inhibiting breast cancer: 

The present invention further provides a method for treating or alleviating a symptom 
of BRC in a subject by decreasing the expression of one or more of the up-regulated BRC- 
associated genes A5657, B9769 and C7965 (or the activity of its gene product). Suitable 

10 therapeutic compounds can be administered prophylactically or therapeutically to a subject 
suffering from or at risk of (or susceptible to) developing BRC. Such subjects can be 
identified using standard clinical methods or by detecting an aberrant level of expression of 
one or more BRC-associated genes ^5557, B9769 and C7965 or aberrant activity of its gene 
product. In the context of the present invention, suitable therapeutic agents mclude, for 

15 example, inhibitors of cell cycle regulation, cell proliferation, and protein kinase activity. 

The therapeutic method of the present invention may include the step of decreasing 
the expression, function, or both, of one or more gene products of genes whose expression is 
aberrantly increased ("up-reg\ilated" or "over-expressed" gene) in breast cells. Expression 
may be inhibited in any of several ways known in the art. For example, expression can be 

20 inhibited by administering to the subject a nucleic acid that inhibits, or antagonizes, the 
expression of the over-expressed gene or genes, an antisense oligonucleotide or small 
interfering RNA which dismpts expression of the over-expressed gene or genes. 
Antisense Nucleic Acids: 

As noted above, antisense nucleic acids corresponding to the nucleotide sequence of 
25 the BRC-associated genes A5657, B9769 and C7965 can be used to reduce the expression 

level of the genes. Antisense nucleic acids corresponding to the BRC-associated genes A5657, 
B9769 and C7965 that are up-regulated in breast cancer are useful for the treatment of breast 
cancer. Specifically, the antisense nucleic acids of the present invention may act by binding 
to the BRC-associated genes A5657, B9769 and C7965^ or mRNAs corresponding thereto, 
30 thereby inhibiting the transcription or translation of the genes, promoting the degradation of 
the mRNAs, and/or inhibiting the expression of proteins encoded by the BRC-associated 
genes A5657, B9769 and C7965, thereby, inhibiting the function of the proteins. The term 
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"antisense nucleic acids" as used herein encompasses both nucleotides that are entirely 
complementary to the target sequence and tiiose having a mismatch of one or more 
nucleotides, so long as the antisense nucleic acids can specifically hybridize to the target 
sequences. For example, the antisense nucleic acids of the present invention include 
5 polynucleotides that have a homology of at least 70% or higher, preferably at least 80% or 
higher, more preferably at least 90% or higher, even more preferably at least 95% or higher 
over a span of at least 15 continuous nucleotides. Algorithms known in the art can be used to 
determmethe homology. 

The antisense nucleic acid of the present invention act on cells producing the proteuis 
10 encoded by BRC-associated marker genes A5657, B9769 and C7965 by bmdmg to the DNAs 
or mRNAs encoding the proteins, inhibiting their transcription or translation, promoting the 
degradation of the mRNAs, and inhibiting the expression of the protems, thereby resulting in 
the inhibition of the protein function. 

An antisense nucleic acid of the present invention can be made into an external 
15 preparation, such as a liniment or a poultice, by admixing it with a suitable base material 
which is inactive against the nucleic acid. 

Also, as needed, the antisense nucleic acids of the present invention can be formulated 
into tablets, powders, granules, capsules, liposome capsules, injections, solutions, nose-drops 
and freeze-drying agents by adding excipients, isotonic agents, solubilizers, stabilizers, 
20 preservatives, pain-killers, and such. These can be prepared by following known methods. 

The antisense nucleic acids of the present invention can be given to the patient by 
direct application onto the ailing site or by injection into a blood vessel so that it will reach 
the site of aihnent. An antisense-mounting medium can also be used to increase durability 
and membrane-permeability. Examples include, but are not limited to, liposomes, poly-L- 
25 lysine, lipids, cholesterol, lipofectin or derivatives of these. 

The dosage of the antisense nucleic acid of the present invention can be adjusted 
suitably according to the patient's condition and used in desired amounts. For example, a 
dose range of 0.1 to 100 mg/kg, preferably 0.1 to 50 mg/kg can be administered. 

The antisense nucleic acids of the present invention inhibit the expression of a protein 
30 of the present invention and are thereby useful for suppressing the biological activity of the 
protein of the invention. In addition, expression-inhibitors, comprising antisense nucleic acids 
of the present invention, are useful in that they can inhibit the biological activity of a protein 
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of the present invention. 

The method of the present invention can be used to alter the expression in a cell of an 
up-regulated BRC-associated gene, e.g., up-regulation resulting from the malignant 
transformation of the cells. Binding of the siRNA to a transcript corresponding to one of the 
5 BRC-associated genes A5657, B9769 or C7965 in the target cell results in a reduction in the 
protein production by the cell. The length of the oligonucleotide is at least 10 nucleotides and 
may be as long as the naturally-occurring transcript. Preferably, the oligonucleotide is 19-25 
nucleotides in length. Most preferably, the oligonucleotide is less than 75, 50, 25 nucleotides 
in length. 

10 The antisense nucleic acids of present invention include modified oligonucleotides. 

For example, thioated oligonucleotides may be used to confer nuclease resistance to an 
oligonucleotide. 

Also, an siRNA against a marker gene can be used to reduce the expression level of 
the marker gene. Herein, term "siRNA" refers to a double stranded RNA molecule which 

IS prevents translation of a target mRNA. Standard techniques for introducing siRNA into the 
cell may be used, including those in which DNA is a template from vMch RNA is transcribed. 
In the context of the present invention, the siRNA comprises a sense nucleic acid sequence 
and an anti-sense nucleic acid sequence against an up-regulated marker gene, such as the 
BRC-associated genes A5657, B9769 and C7965. The siRNA is constructed such that a 

20 single transcript has both the sense and complementary antisense sequences from the target 
gene, e.g., a hairpin. 

siRNA of A5657, B9769 or C7965, which hybridize to target mRNA, decrease or 
inhibit production of the A5657, B9769 and C7965 polypeptides encoded by the A5657, 
B9769 and C7965 genes, respectively, by associating with the normally single-stranded 

25 mRNA transcript, thereby interfering wdth translation and thus, expression of the protein. The 
siRNA is preferably less than 500, 200, 100, 50, or 25 nucleotides in length. More preferably 
the siRNA is 19-25 nucleotides in length. Exemplary nucleic acid sequence for the production 
of A5657, B9769 and C7965 siRNA include the sequences of nucleotides of SEQ ID NOs: 28, 
29, 30, 3 1, 32, 33, and 34 as the target sequence. Furthermore, in order to enhance the 

30 inhibition activity of the siRNA, nucleotide 'V can be added to 3 'end of the antisense strand 
of the target sequence. The number of *Vs to be added is at least 2, generally 2 to 10, 
preferably 2 to 5. The added "u"s form single strand at the 3 'end of the antisense strand of 
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the siRNA. 

An siRNA of A5657, B9769 or C7965 can be directly introduced into the cells in a 
form that is capable of binding to the mRNA transcripts. Alternatively, a DNA encoding the 
siRNA of A5657, B9769 or C7965 may be carried in a vector. 

5 Vectors may be produced, for example, by cloning an A5657, B9769 or C7965 target 

sequence into an expression vector operatively-linked regulatory sequences flanking the 
A5657, B9769 or C7965 sequence in a manner that allows for expression (by transcription of 
the DNA molecule) of both strands (Lee, N.S., Dohjima, T., Bauer, G., Li, H., Li, M.-J., 
Ehsani, A.,Salvaterra, P., and Rossi, J. (2002) Expression of small interfering RNAs targeted 

10 against HIV-1 rev transcripts in human cells. Nature Biotechnology 20 : 500-505.). An RNA 
molecule that is antisense to A5657, B9769 or C7965 mRNA is transcribed by a first 
promoter (e.^., a promoter sequence 3' of the cloned DNA) and an RNA molecule that is the 
sense strand for the A5657, B9769 or C7965 mRNA is transcribed by a second promoter (e.g., 
a promoter sequence 5' of the cloned DNA). The sense and antisense strands hybridize in 

15 vivo to generate siRNA constracts for silencing of the A5657, B9769 or C7965 gene. 

Alternatively, the two constructs can be utilized to create the sense and anti-sense strands of a 
siRNA construct. Cloned A5657, B9769 or C7965 can encode a construct having secondary 
structure, e.g., hairpins, wherein a single transcript has both the sense and complementary 
antisense sequences from the target gene. 

20 A loop sequence consisting of an arbitrary nucleotide sequence can be located 

between the sense and antisense sequence in order to form the hairpin loop structure. Thus, 
the present invention also provides siRNA having the general formula 5'-[A]-[B]-[A']-3% 
wherem [A] is a ribonucleotide sequence corresponding to a sequence selected from the group 
consisting of nucleotides of SEQ ID NOs: 28, 29, 30, 31, 32, 33, and 34, 

25 [B] is a ribonucleotide sequence consisting of 3 to 23 nucleotides, and 

[A'] is a ribonucleotide sequence consisting of the complementary sequence of [A]. 
The region [A] hybridizes to [A'], and then a loop consisting of region [B] is formed. The 
loop sequence may be preferably 3 to 23 nucleotide in length. The loop sequence, for 
example, can be selected from group consisting of following sequences 

30 (http://www.ambion.com/techlib/tb/tb_506.html). Furthermore, loop sequence consisting of 
23 nucleotides also provides active siRNA (Jacque, J.-M., Triques, K., and Stevenson, M. 
(2002) Modulation of HIV-1 replication by RNA interference. Nature 418 : 435-438.). 
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CCC, CCACC or CCACACC: Jacque, J. M, Triques, K., and Stevenson, M (2002) 
Modulation of HIV-1 replication by RNA interference. Nature, Vol. 418: 435-438, 

UUCG: Lee, N.S,, Dohjima, T., Bauer, G., Li, H., Li, M.-J., Ehsani, A., Salvaterra, P., 
and Rossi, J. (2002) Expression of small interfering RNAs targeted against HTV-l rev 
5 transcripts in human cells. Nature Biotechnology 20 : 500-505. Fruscoloni, P., Zamboni, M,, 
and Tocchini-Valentini, G. P. (2003) Exonucleolj^ic degradation of double-stranded RNA by 
an activity in Xenopus laevis germinal vesicles. Proc. Natl. Acad. Sci. USA 100(4): 1639- 
1644. 

UUCAAGAGA: Dykxhoom, D. M., Novina, C. D., and Sharp, P. A. (2002) Killing 
10 the messenger: Short RNAs that silence gene expression. Nature Reviews Molecular Cell 
Biology 4: 457-467. 

For example, preferable siRNAs having hairpin loop structure of the present invention 
are shown below. In the following structure, the loop sequence can be selected from group 
consisting of, CCC, UUCG, CCACC, CCACACC, and UUCAAGAGA. Preferable loop 
15 sequence is UUCAAGAGA Cttcaagaga" in DNA). Exemplary hairpin siRNA suitable for 
use in the context of the present invention include: 
forA5657-siRNA: 

caucgcaacuguguugacc-[b]-ggucaacacaguugcgaug (for target sequence of SEQ ID NO: 28); 
and 

20 ugccagacaguggacagag-[b]-cucuguccacugucuggca (for target sequence of SEQ ID NO: 29). 
forB9769-siRNA: 

gccugcaguuccugcagca-[b]-ugcugcaggaacugcaggc (for target sequence of SEQ ID NO: 30); 
gcuuccagucugucaaguc-[b]-gacuugacagacuggaagc (for target sequence of SEQ ID NO: 31); 
and 

25 agcagaggccucuaaugca-[b]-ugcauuagaggccucugcu (for target sequence of SEQ ID NO: 32). 
forC7965-siRNA: 

acugcuccucucagcuucc-[b]-ggaagcugagaggagcagu (for target sequence of SEQ ID NO: 33); 
and 

guacgcuuacuggcaucaa-[b]-uugaugccaguaagcguac (for target sequence of SEQ ID NO: 34) . 
30 The nucleotide sequence of suitable siRNAs can be designed using an siRNA design 

computer program available from the Ambion website (http://www.ambion.com/techlib/ 
misc/siRNA_finder.html). The computer program selects nucleotide sequences for siRNA 
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synthesis based on the following protocol. 

Selection of siRNA Target Sites : 
1. Beginning with the AUG start codon of the object transcript, scan downstream for AA 
dinucleotide sequences. Record the occurrence of each AA and the 3* adjacent 19 
5 nucleotides as potential siRNA target sites- Tuschl, et al. recommend against designing 

siRNA to the 5* and 3* imtranslated regions (UTRs) and regions near the start codon 
(within 75 bases) as these may be richer in regulatory protein bindiog sites, UTR-binding 
proteins and/or translation initiation complexes may interfere with binding of the siRNA 
endonuclease complex. 

10 2. Compare the potential target sites to the human genome database and eliminate from 

consideration any target sequences with significant homology to other coding sequences. 
The homology search can be performed usmg BLAST, which can be found on the NCBI 
server at: www.ncbi.nlm.nih.gov/BLAST/. 
3. Select qualifying target sequences for synthesis. At Ambion, preferably several target 
15 sequences can be selected along the length of the gene to evaluate. 

The regulatory sequences flanking the A5657, B9769 or C7965 sequences can be 
identical or different, such that their expression can be modulated independently, or in a 
temporal or spatial manner. siRNAs are transcribed intracellularly by cloning the A5657, 
B9769 or C7965 gene templates, respectively, into a vector containing, e.g., a RNA pol in 
20 transcription unit from the small nuclear RNA (snRNA) U6 or the human HI RNA promoter. 
For introducing the vector into the cell, transfection-enhancing agent can be used. FuGENE 
(Rochediagnostices), Lipofectamin 2000 (Invitrogen), Oligofectamin (Invitrogen), and 
Nucleofactor (Wako pure Chemical) are usefiil as the transfection-enhancing agent. 
The antisense oligonucleotide or siRNA of the present invention inhibits the 
25 expression of a polypeptide of the present invention and is thereby useful for suppressing the 
biological activity of a polypeptide of the invention. Also, expression-inhibitors, comprising 
the antisense oligonucleotide or siRNA of the invention, are useful in the point that they can 
inhibit the biological activity of the polypeptide of the invention. Therefore, a composition 
comprising an antisense oligonucleotide or siRNA of the present invention is useful for 
30 treating a breast cancer. 
Antibodies: 
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Altematively, function of one or more gene products of the genes over-expressed in 
BRC (e.g., A5657, B9769 or C7965) can be inhibited by administering a compound that binds 
to or otherwise inhibits the function of the gene products. For example, the compound may 
be an antibody which binds to the over-expressed gene product or gene products. 

5 The present invention refers above to the use of antibodies, particularly antibodies 

against a protein encoded by an up-regulated marker gene, or a fragment of such an antibody. 
As used herein, the term "antibody" refers to an immunoglobulin molecule having a specific 
structure, that mteracts (i.e., binds) only with the antigen that was used for synthesizing the 
antibody (i.e., the gene product of an up-regulated marker gene such as A5657, B9769 or 

10 C7965) or with an antigen closely related thereto. As noted above, in the context of the 

present invention, an antibody may be a fragment of an antibody or a modified antibody, so 
long as it binds to one or more of the proteins encoded by the marker genes. For instance, the 
antibody fragment may be Fab, F(ab')2, Fv, or single chain Fv (scFv), in which Fv fragments 
from H and L chains are ligated by an appropriate linker (Huston J. S. et al. Proc. Natl. Acad. 

15 Sci. U.S.A. 85:5879-5883 (1988)). More specifically, an antibody fragment may be generated 
by treating an antibody with an enzyme, such as papain or pepsin. Altematively, a gene 
encoding the antibody fragment may be constructed, inserted into an expression vector, and 
expressed in an appropriate host cell (see, for example, Co M. S. et al. J. Immunol. 152:2968- 
2976 (1994); Better M. and Horwitz A. H. Methods Enzymol. 178:476-496 (1989); Pluckthun 

20 A. and Skerra A. Methods Enzymol. 178:497-515 (1989); Lamoyi R Methods Enzymol. 
121:652-663 (1986); Rousseaux J. et al. Methods Enzymol. 121:663-669 (1986); Bird R. E. 
and Walker B. W. Trends Biotechnol. 9:132-137 (1991)). 

An antibody may be modified by conjugation with a variety of molecules, such as 
polyethylene glycol (PEG). The present invention provides such modified antibodies. The 

25 modified antibody can be obtained by chemically modifying an antibody. Such modification 
methods are conventional in the field. 

Altematively, an antibody may comprise a chimeric antibody having a variable region 
derived from a nonhuman antibody and a constant region derived from a human antibody, or a 
humanized antibody, comprising a complementarity determining region (CDR) derived from 

30 a nonhuman antibody, a frame work region (FR) and a constant region derived from a human 
antibody. Such antibodies can be prepared by using known technologies. 

CancCT therapies directed at specific molecular alterations that occur in cancer cells 
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have been validated through clinical development and regulatory approval of anti-cancer 
drugs such as trastuzumab (Herceptin) for the treatment of advanced breast cancer, imatinib 
methylate (Gleevec) for chronic myeloid leukemia, gefitinib (Iressa) for non-small cell lung 
cancer (NSCLC), and rituximab (anti-CD20 mAb) for B-cell lymphoma aad mantle cell 

5 lymphoma (Ciardiello F, Tortora G. A novel approach in the treatment of cancer: targeting the 
epidermal growth factor receptor, Clin Cancer Res. 2001 Oct;7(10):2958-70. Review.; 
Slamon DJ, Leyland- Jones B, Shak S, Fuchs H, Paton V, Bajamonde A, Fleming T, Eiermann 
W, Wolter J, Pegram M, Baselga J, Norton L. Use of chemotherapy plus a monoclonal 
antibody against HER2 for metastatic breast cancer that overexpresses HER2. N Engl J Med. 

10 2001 Mar 15;344(ll):783-92.; Rehwald U, Schulz H, Reiser M, Sieber M, Staak JO, 

Morschhauser F, Driessen C, Rudiger T, MuUer-Heraielink K, Diehl V, Engert A. Treatment 
of relapsed CD20+ Hodgkin lymphoma with the monoclonal antibody rituximab is effective 
and well tolerated: results of a phase 2 trial of the German Hodgkin Lymphoma Study Group. 
Blood. 2003 Jan 15;101(2):420"424.; Fang G, Kim CN, Perkins CL, Ramadevi N, Winton E, 

15 Wittmann S and Bhalla KN. (2000). Blood, 96, 2246-2253.). These drugs are clmically 
effective and better tolerated than traditional anti-cancer agents because they target only 
transforaied cells. Hence, such drugs not only improve survival and quality of life for cancer 
patients, but also validate the concept of molecularly targeted cancer therapy. Furthermore, 
targeted drugs can enhance the eflScacy of standard chemotherapy when used in combination 

20 with it (Gianni L. (2002). Oncology, 63 Suppl 1, 47-56.; Klejman A, Rushen L, Morrione A, 
Slupianek A and Skorsld T. (2002). Oncogene, 21, 5868-5876.). Therefore, future cancer 
treatments will probably involve combining conventional drugs with target-specific agents 
aimed at different characteristics of tumor cells such as angiogenesis and invasiveness. 

These modulatory methods can be performed ex vivo or in vitro (e.g., by culturing the 

25 cell with the agent) or, altematively, in vivo (e.g., by administering the agent to a subject). 
The methods involve administering a protein pr combination of proteins or a nucleic acid 
molecule or combination of nucleic acid molecules as thenqpy to coiinteract aberrant 
expression of the differentially expressed genes or aberrant activity of their gene products. 
Diseases and disorders that are characterized by increased (relative to a subject not 

30 suffering from the disease or disorder) expression levels or biological activities of genes and 
gene products, respectively, may be treated with therapeutics that antagonize (i.e., reduce or 
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inhibit) activity of the over-expressed gene or genes. Therapeutics tiiat antagonize activity 
can be administered therapeutically or prophylactically. 

Accordingly, therapeutics that may be utilized in the context of the present invention 
include, e.g., (i) a polypeptide of the over-expressed gene or genes, or analogs, derivatives, 

5 fragments or homologs thereof; (ii) antibodies to the over-expressed gene or gene products; 
(iii) nucleic acids encoding the over-expressed gene or genes; (iv) antisense nucleic acids or 
nucleic acids that are ''dysfunctional'' (i.e., due to a heterologous insertion within the nucleic 
acids of one or more over-expressed gene or genes); (v) small interfering RNA (siRNA); or 
(vi) modulators (i.e., inhibitors, antagonists that alter the interaction between an over- 

10 expressed polypeptide and its binding partner). The dysfunctional antisense molecules are 
utilized to "knockout" endogenous function of a polypeptide by homologous recombination 
(see, e.g,, Capecchi, Science 244: 1288-1292 1989). 

Increased levels can be readily detected by quantifying peptide and/or RNA, by 
obtaining a patient tissue sample (e.g., from biopsy tissue) and assaying it in vitro for RNA or 

15 peptide levels, structure and/or activity of the expressed peptides (or mRNAs of a gene whose 
expression is altered). Methods that are well-known within the art include, but are not limited 
to, immunoassays (e.g., by Western blot analysis, immimoprecipitation followed by sodium 
dodecyl sulfate (SDS) polyacrylamide gel electrophoresis, immimocytochemistry, etc.) and/or 
hybridization assays to detect expression of mRNAs (e.g., Northem assays, dot blots, in situ 

20 hybridization, etc.). 

Prophylactic administration occurs prior to the manifestation of overt clinical 
symptoms of disease, such that a disease or disorder is prevented or, alternatively, delayed in 
its progression. 

25 

Vaccinating against breast cancer: 

The present invention also relates to a method of treating or preventing breast cancer 
in a subject comprising the step of administering to said subject a vaccine comprising a 
polypeptide encoded by a nucleic acid selected from the group consisting of the BRC- 
30 associated genes A5657, B9769 or C7965 , an immunologically active fragment of said 
polypeptide, or a polynucleotide encoding such a polypeptide or fragment thereof 
Administration of the polypeptide induces an anti-tumor immunity in a subject. To induce 
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anti-tumor immunity, a polypeptide encoded by a nucleic acid selected from the group 
consisting of the BRC-associated genes A5657, B9769 or C7965^ an immunologically active 
fragment of said polypeptide, or a polynucleotide encoding such a polypeptide or fragment 
thereof is administered to subject in need thereof The polypeptide or the immunologically 
5 active fragments thereof are usefiil as vaccines against BRC» In some cases, the proteins or 
fragments thereof may be administered in a form bound to the T cell receptor (TCR) or 
presented by an antigen presenting cell (APC), such as macrophage, dendritic cell (DC), or B- 
cells. Due to the strong antigen presenting ability of DC, the use of DC is most preferable 
among the APCs. 

10 In the present invention, a vaccine against BRC refers to a substance that has the 

ability to induce anti-tumor immunity upon inoculation into animals. According to the 
present invention, polypeptides encoded by the BRC-associated genes A5657, B9769 or 
C7965, or fragments thereof, were suggested to be HLA-A24 or HLA-A*0201 restricted 
epitopes peptides that may induce potent and specific immune response against BRC cells 

15 expressmg the BRC-associated genes A5657, B9769 or C7965. Thus, the present invention 
also encompasses a method of inducing anti-tumor immunity using the polypeptides. In 
general, anti-tumor immunity includes immune responses such as follows: 

- induction of cytotoxic lymphocytes against tumors, 

- induction of antibodies that recognize tumors, and 
20 - induction of anti-tumor cytokine production. 

Therefore, when a certain protein induces any one of these immune responses upon 
inoculation into an animal, the protein is determined to have anti-tumor immunity inducing 
effect The induction of the anti-tumor immunity by a protein can be detected by observing in 
vivo or in vitro the response of the immune system in the host against the protein. 

25 For example, a method for detecting the induction of cytotoxic T lymphocytes is well 

known. Specifically, a foreign substance that, enters the Uving body is presented to T cells 
and B cells by the action of antigen presenting cells (APCs). T cells that respond to the 
antigen presented by the APCs in an antigen specific manner differentiate into cytotoxic T 
cells (or cytotoxic T lymphocytes; CTLs) due to stimulation by the antigen, and then 

30 proliferate (this is referred to as activation of T cells). Therefore, CTL induction by a certain 
peptide can be evaluated by presenting the peptide to a T cell via an APC, and detecting the 
induction of CTLs. Furthermore, APCs have the effect of activating CD4+ T cells, CD8+ T 
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cells, macrophages, eosinophils, and NK cells. Since CD4+ T cells and CD8+ T cells are also 
important in anti-tumor inMnunity, the anti-tumor immunity-inducing action of the peptide 
can be evaluated using the activation effect of these cells as indicators. 

A method for evaluating the inducing action of CTLs using dendritic cells (DCs) as 

5 the APC is well known in the art. DCs are a representative APCs having the strongest CTL- 
inducing action among APCs. In this method, the test polypeptide is initially contacted with 
DCs, and then the DCs are contacted with T cells. Detection of T cells having cytotoxic 
effects against the cells of interest after the contact with DC shows that the test polypeptide 
has an activity of inducing the cytotoxic T cells. Activity of CTLs against tumors can be 

10 detected, for example, using the lysis of ^^Cr-labeled tumor cells as the indicator. 

Alternatively, the method of evaluating the degree of tumor cell damage using ^H-thymidine 
uptake activity or LDH (lactose dehydrogenase)-release as the indicator is also well known. 

Apart from DCs, peripheral blood mononuclear cells (PBMCs) may also be used as 
the APC. The induction of CTLs has been reported to be enhanced by culturing PBMCs in 

15 the presence of GM-CSF and IL-4. Similarly, CTLs have been shown to be induced by 
culturing PBMCs in the presence of keyhole limpet hemocyanin (KLH) and IL-7. 

Test polypeptides confirmed to possess CTL-inducing activity by these methods are 
deemed to be polypeptides having DC activation eflFect and subsequent CTL-inducing activity. 
Therefore, polypeptides that induce CTLs against tumor cells are useful as vaccines against 

20 tumors. Furthermore, APCs that have acquired the ability to induce CTLs against tumors 

through contact with the polypeptides are also useful as vaccines against tumors. Furthermore, 
CTLs that have acquired cytotoxicity due to presentation of the polypeptide antigens by APCs 
can be also be used as vaccines against tumors. Such therapeutic methods for tumors, using 
anti-tumor immunity due to APCs and CTLs, are referred to as cellular immunotherapy. 

25 Generally, when using a polypeptide for cellular immunotherapy, efficiency of the 

CTL-induction is known to be increased by combining a plurality of polypeptides having 
different structures and contacting them with DCs. Therefore, when stimulating DCs with 
protein fragments, it is advantageous to use a mixture of multiple types of fragments. 
Alternatively, the induction of anti-tumor immunity by a polypeptide can be 

30 confirmed by observing the induction of antibody production against tumors. For example, 
when antibodies against a polypeptide are induced in a laboratory animal immunized with the 
polypeptide, and when growth of tumor cells is suppressed by those antibodies, the 
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polypeptide is deemed to have the ability to induce anti-tumor immunity. 

Anti-tumor immunity is induced by administering the vaccine of this invention, and 
the induction of anti-tumor immunity enables treatment and prevention of BRC. Therapy 
against cancer or prevention of the onset of cancer includes any of the following steps, such 

5 as inhibition of the growth of cancerous cells, involution of cancer, and suppression of the 
occurrence of cancer. A decrease in mortality and morbidity of individuals having cancer, 
decrease in the levels of tumor maikers in the blood, alleviation of detectable symptoms 
accompanying cancer, and such are also included in the therapy or prevention of cancer. 
Such therapeutic and preventive efifects are preferably statistically significant. For example, 

10 in observation, at a significance level of 5% or less, wherein the therapeutic or preventive 
effect of a vaccine against cell proliferative diseases is compared to a control without vaccine 
adnnnistration. For example. Student's t-test, the Mann- Whitney U-test, or ANOVA may be 
used for statistical analysis. 

The above-mentioned protein having immunological activity or a vector encoding 

IS the protein may be combined vdth an adjuvant. An adjuvant refers to a compound that 
enhances the inmiune response against the protein when administered together (or 
successively) with the protem having immunological activity. Exemplary adjuvants include, 
but are not limited to, cholera toxin, salmonella toxin, alxmi, and such, but are not limited 
thereto. Furthermore, the vaccine of this invention may be combined appropriately with a 

20 pharmaceutically acceptable carrier. Examples of such carriers include sterilized water, 

physiological salme, phosphate buffer, culture fluid, and suck Furthermore, the vaccine may 
contain as necessary, stabilizers, suspensions, preservatives, surfactants, and such. The 
vaccine can be administered systemically or locally. Vaccine administration can be 
performed by single administration, or boosted by multiple administrations. 

25 When using an APC or CTL as the vaccine of this invention, tumors can be treated 

or prevented, for example, by the ex vivo method. More specifically, PBMCs of the subject 
receiving treatment or prevention are collected, the cells are contacted with the polypeptide ex 
vivo, and following the induction of APCs or CTLs, the cells may be administered to the 
subject. APCs can be also mduced by introducing a vector encoding the polypeptide mto 

30 PBMCs ex vivo. APCs or CTLs induced in vitro can be cloned prior to administration. By 
cloning and growing cells having high activity of damaging target cells, cellular 
immunotherapy can be performed more effectively. Furthermore, APCs and CTLs isolated in 
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tbis manner may be used for cellular immunotherapy not only against individuals from whom 
the cells are derived, but also against similar types of tumors from other individuals. 

Furthermore, a pharmaceutical composition for treating or preventing a cell 
proliferative disease, such as cancer, comprising a pharmaceutically effective amount of the 
5 polypeptide of the present invention is provided. The pharmaceutical composition may be 
used for raising anti tumor immunity. 

Pharmaceutical compositions for inhibitim BRC or malignant BRC: 

In the context of the present invention, suitable pharmaceutical formulations include 
those suitable for oral, rectal, nasal, topical (including buccal and sub-lingual), vaginal or 

10 parenteral (including intramuscular, subcutaneous and intravenous) administration, or for 
administration by inhalation or insiifflation. Preferably, administration is intravenous. The . 
formulations are optionally packaged in discrete dosage units. 

Pharmaceutical formulations suitable for oral administration include capsules, cachets 
or tablets, each containing a predetemiined amount of active ingredient Suitable 

15 formulations also include powders, granules, solutions, suspensions and emulsions. The active 
ingredient is optionally administered as a bolus electuary or paste. Tablets and capsules for 
oral administration may contain conventional excipients, such as binding agents, fillers, 
lubricants, disintegrant and/or wetting agents. A tablet may be made by compression or 
molding, optionally with one or more formulational ingredients. Compressed tablets may be 

20 prepared by compressing in a suitable machine the active ingredients in a free-flowing form, 
such as a powder or granules, optionally mixed with a binder, lubricant, inert diluent, 
lubricating, surface active and/or dispersing agent. Molded tablets may be made by molding 
in a suitable machine a mixture of the powdered compound moistened with an inert liquid 
diluent. The tablets may be coated according to methods well known in the art Oral fluid 

25 preparations may be in the form of, for example, aqueous or oily suspensions, solutions, 

emulsions, syrups or elixirs, or may be presented as a dry product for constitution with water 
or other suitable vehicle before use. Such liquid preparations may contain conventional 
additives, such as suspending agents, emulsifying agents, non-aqueous vehicles (which may 
include edible oils), and/or preservatives. The tablets may optionally be formulated so as to 

30 provide slow or controlled release of the active ingredient therein. A package of tablets may 
contain one tablet to be taken on each of the month. 
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Formulations suitable for parenteral admixiistration include aqueous and non-aqueous 
sterile injection solutions, optionally contain anti-oxidants, buffers, bacteriostats and solutes 
which render the formulation isotonic with the blood of the intended recipient; as well as 
aqueous and non-aqueous sterile suspensions including suspending agents and/or thickening 

5 agents. The formulations may be presented in imit dose or multi-dose containers, for example 
as sealed ampoules and vials, and may be stored in a freeze-dried (lyophilized) condition, 
requiring only the addition of the sterile liquid carrier, for example, saline, water-for-injection, 
immediately prior to use. Alternatively, the formulations may be presented for continuous 
infusion. Extemporaneous injection solutions and suspensions may be prepared from sterile 

10 powders, granules and tablets of the kind previously described. 

Formulations suitable for rectal administration include suppositories with standard 
carriers such as cocoa butter or polyethylene glycol. Formulations suitable for topical 
administration in the mouth, for example, buccally or sublingually, include lozenges, 
containing the active ingredient in a flavored base such as sucrose and acacia or tragacanth, 

IS and pastilles, comprising the active ingredient in a base such as gelatin and glycerin or 
sucrose and acacia. For intra-nasal administration, the compounds of the invention may be 
used as a liquid spray, a dispersible powder, or in the form of drops. Drops may be 
formulated with an aqueous or non-aqueous base also comprising one or more dispersing 
agents, solubilizing agents and/or suspending agents. 

20 For administration by inhalation the compounds can be convenientiy delivered from 

an insufQator, nebulizer, pressurized packs or other convenient means of delivering an aerosol 
spray. Pressurized packs may comprise a suitable propellant such as dichlorodifluoromethane, 
trichlorofluoromethane, dichiorotetrafluoroethane, carbon dioxide or other suitable gas. In 
the case of a pressurized aerosol, the dosage unit may be determined by providing a valve to 

25 deliver a metered amount. 

Alternatively, for administration by inhalation or insufflation, the compoimds may 
take the form of a dry powder composition, for example a powder mix of the compound and a 
suitable powder base, such as lactose or starch. The powder composition may be presented in 
unit dosage form, for example, as capsules, cartridges, gelatin or blister packs, from which the 

30 powder may be administered with the aid of an inhalator or insufflators. 

Other formulations include implantable devices and adhesive patches which release a 
therapeutic agent. 
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When desired, the above described foimulations, adapted to give sustained release of 
the active ingredient, may be employed. The pharmaceutical compositions may also contain 
oiher active ingredients, such as antimicrobial agents, immunosuppressants and/or 
preservatives. 

5 It should be understood that in addition to the ingredients particularly mentioned 

above, the formulations of this invention may include other agents conventional in the art 
with regard to the type of formulation in question- For example, formulations suitable for oral 
adnoJnistration may include flavoring agents. 

Preferred unit dosage formulations contain an effective dose, as recited below, or an 
10 appropriate fraction thereof, of the active ingredient. 

For each of the aforementioned conditions, the compositions, e.g., polypeptides and 
organic compoxmds, can be administered orally or via injection at a dose ranging from about 
0.1 to about 250 mg/kg per day. The dose range for adult humans is generally from about 5 
mg to about 17.5 g/day, preferably about 5 mg to about 10 g/day, and most preferably about 
15 100 mg to about 3 g/day. Tablets or other unit dosage forms of presentation provided in 
discrete units may conveniently contain an amoimt which is effective at such dosage or as a 
multiple of the same, for instance, imits containing about 5 mg to about 500 mg, usually from 
about 1 00 mg to about 500 mg. 

The dose employed will depend upon a nimiber of factors, including the age and sex 
20 of the subject, the precise disorder being treated, and its severity. Also the route of 
administration may vary depending upon the condition and its severity. In any event, 
appropriate and optimum dosages may be routinely calculated by those skilled m the art, 
taking into consideration the above-mentioned factors. 

Aspects of the present invention are described in the following examples, which are 
25 not intended to limit the scope of the invention described in the claims. The following 

examples illustrate the identification and characteri2ation of genes differentially expressed in 
BRC cells. 



Examples 



30 



Tissue obtained from diseased tissue (e.g., epithelial cells from BRC) and normal 
tissues was evaluated to identify genes which are differently expressed or a disease state, e.g., 



wo 2005/029067 



PCT/JP2004/011741 



-58- 

BRC. The assays were carried out as follows. 
Patients and tissue samples: 

Primary breast cancers were obtained with informed consent fix)m 81 patients (12 ductal 
5 carcinoma in situ and 69 invasive ductal carcinoma from 2 cm to 5 cm(T2), median age 45 in 
a range of 21 to 68 years old) who treated at Department of Breast Surgery, Cancer Institute 
Hospital, Tokyo, Japan, concerning which all patients had given informed consent (Table 2). 
Clinical information was obtained from medical records and each tumor was diagnosed 
according to histopathological subtype and grade by pathologists. Tumor tissue was used to 
10 evaluate tumor type (according to the World Health Organization classification and the 
Japanese cancer society classification). Clinical stage was judged according to the JBCS 
TNM classification. No significant differences were observed between node-positive and 
node-negative cases. The presence of angioinvasive growth and extensive lymphocytic 
infiltrate was detemiined by pathologists. Estrogen receptor (BR) and progesterone receptor 
15 (PgR) expression was determined by EIA (ER negative when less than 13finol/mg protein, 
BML). A mixture of normal breast ductal cells from the 15 premenopausal patients with 
breast cancer or the 12 postmenopausal patients were used as normal controls, respectively. 
All samples were immediately frozen and stored at -80°C. 

Tissue Samples and LMM: 

20 Cliiiicd and pathological mforination on the tumor is detailed in Table 2. Samples 

were embedded in TissueTek OCT medium (Sakura) and then stored at -80°C until use. 
Frozen specimens were serially sectioned in S-^xm slices with a cryostat and stained with 
hematoxylin and eosin to define the analyzed regions. To avoid cross-contamination of 
cancer and noncancerous cells, these two populations were prepared by EZ Cut LMM System 

25 (SL Microtest GmbH) followed the manufacture's protocol with several modifications. To 
minimize the eflFects during storage process and tissue collection, the cancer tissues were 
carefiiUy handled by the same procedure. To check the quality of RNAs, total RNA extracted 
from the residual tissue of each case were electrophoresed under the degenerative agarose gel, 
and confirmed their quality by a presence of ribosomal RNA bands. 

30 

Cell lines: 

Human-breast cancer cell lines HBC4, HBC5, MDA-MB-231, BSY-1 were kindly 
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provided by Dr. Yamori (The j£5)anese Foundation of Cancer Research, Tolqro), MCF7, 
T47D, SKBR3, HCC1937, MDA-MB-435S, YMBl, HBLIOO, COS?, NIH3T3 were obtained 
j&om ATCC. All cells were cultured in appropriate media; i.e. RPMI-1640 (Sigma, St Louis, 
MO) for HBC4, HBC5, SKBR3, T47D, YMBl, and HCC1937 (with 2mM L-glutamine); 

5 Dulbecco's modified Eagle's medium (Invitrogen, Carlsbad, CA) for HBLIOO, C0S7, 

NIH3T3; EMEM (Sigma) with O.lmM essential amino acid ^oche), ImM sodium pyruvate 
(Roche), O.Olmg/ml Insulin(Sigma) for MCM7; L-15 (Roche) for MDA-MB-231 and MDA- 
MB-435S. Each medium was supplemented with 10% fetal bovine serum (Cansera) and 1% 
antibiotic/antimycotic solution (Sigma). MDA-MB-231 and MDA-MB-435S cells were 

10 maintained at 37°C an atmosphere ofhumidified air without CO2. Other cell lines were 
maintained at 37°C an atmosphere of humidified air with 5% CO2. 

RNA Extraction and T7-Based RNA Amplification: 

Total RNA was extracted from each population of laser captured cells into 3S0)xl 

15 RLT lysis buffer (QIAGEN). The extracted RNA was treated for 30 minutes at room 

temperature with 30 units of DNase I (QL^GEN). After inactivation at 70*'C for 10 min, the 
RNAs were purified with an RNeasy Mini Kit (QIAGEN) according to the manufacturer's 
recommendations. All of the DNase I treated RNA was subjected to T7-based amplification 
using Ampliscribe T7 Transcription Kit (Epicentre Technologies). Two rounds of 

20 amplification yielded 28.8-329.4 \ig of amplified RNAs (aRNAs) for each sample, whereas 
when RNAs firom normal samples firom IS premenopausal patients or 12 postmenopausal 
patients were amplified, total of 2240.2ng and 2023.8ng were yielded, respectively. 2.5fig 
aliquots of aRNA 60m. each cancerous cells and noncancerous breast ductal cells were 
reverse-transcribed in the presence of Cy5-dCTP and Cy3-dCTP (Amersham Biosciences), 

25 respectively. 

cDNA microarrays: 

A "genome-wide" cDNA microairay system was established containing 23,040 
cDNAs selected from, the UniGene database (build #13 1) the National Center for 
30 Biotechnology Information (NCBI). Fabrication of the cDNA microarray sUdes has been 
described elsewhere (Ono K, Tanaka T, Tsunoda T, Kitahara 0, Kihara C, Okamoto A, 
Ochiai K, Katagiri T and Nakamura Y. Identification by cDNA Microarray of Genes Involved 
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in Ovarian Carcinogenesis. Cancer Res., 60, 5007-1 1, 2000.). Briefly, the cDNAs were 
amplified by reverse transcription-PCR using poly(A)+RNA isolated from various human 
organs as templates; lengths of the amplicons ranged from 200 to 1 100 bp without repetitive 
or poly(A) sequences. The PGR products were spotted in duplicate on type-7 glass slides 
5 (Amersham Bioscience) using a Lucidea Array Spotter (Amersham Biosciences); 4,608 or 
9,216 genes were spotted in duplicate on a smgle slide. Three different sets of slides (total 
23,040 genes) were prepared, on each of which the same 52 housekeeping genes and two 
kinds of negative-control genes were spotted as well. 

10 Hybridization and Acquisition of Data: 

Hybridization and washing were performed according to protocols described 
previously except that all processes were carried out with an Automated Slide Processor 
(Amersham Biosciences) (Giuliani, N., et al., V. Human myeloma cells stimulate the receptor 
activator of nuclear factor-kappa B ligand (RANKL) m T lymphocytes: a potential role in 

15 multiple myeloma bone disease. Blood, 100: 4615-4621, 2002.). The intensity of each 
hybridization signal was calculated photometrically by the ArrayVision computer program 
(Amersham Biosciences) and background intensity was subtracted. The fluorescence 
intensities of Cy5 (tumor) and Cy3 (control) for each target spot were adjusted so that the 
mean Cy5/Cy3 ratio was performed using averaged signals from the 52 housekeeping genes. 

20 Because data derived from low signal intensities are less reliable,a cut-off value on each slide 
was determined as described previously (Ono, K., et al.,Identification by cDNA microarray of 
genes involved in ovarian carcinogenesis. Cancer Res, 60: 5007-501 1, 2000.) excluded genes 
from fiirther analysis when both Cy3 and Cy5 dyes yielded signal intensities lower than the 
cut-off (Saito-Hisaminato, A., Katagiri, T., Kakiuchi, S., Nakamura, T., Tsunoda, T., and 

25 Nakamura, Y. Genome-wide profiling of gene expression in 29 normal human tissues with a 
cDNA microarray. DNA Res, 9: 35-45, 2002.). For other genes, the Cy5/Cy3 ratio was 
calculated using the raw data of each sample. 



30 



Identification and isolation of a novel human genes A5657, B9769 and C7965: 

Total RNAs were extracted and amplified as discussed above. Aliquots of amplified 
RNA from breast cancer cells and the normal breast ductal cells were labeled by reverse 
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transcription witii Cy5-dCTP and Cy3-dCTP, respectively (Amersham Biosciences, 
Buckinghamshire, UK). Hybridization, washing, and detection were carried out as described 
above. To detect genes that were commonly up-regulated in breast cancer, the overall 
expression patterns of the 23,040 genes on the microarray were screened to select those with 

5 expression ratios >3.0 that were present in >50% of i) all of 81 breast cancer cases, ii) 69 
invasive ductal carcinomas, iii) 3 1 well-, iv) 14 moderately, or v) 24 poorly-differentiated 
lesions, respectively. Among the total of 102 genes that appeared to up-regulated in tumor 
cells, the three with the following in-house identification number, A5657, B9769 and C7965 
were selected for further exanodnation because their expression ratio were greater than 3.0 in 

10 more than 50% of the informative breast cancer cases. 

Semi-quantitative RT-PCR: 

The three up-regulated genes, A5657, B9769 and C7965^ mentioned above were selected and 
flieir expression levels were examined by applying the semi-quantitative RT-PCR 

15 experiments. Specifically, total RNA were extracted, amplified and reverse transcribed as 
described above. Appropriate dilutions of each single-stranded cDNA were prepared for 
subsequent PGR amplification by monitoring the glyceraldehyde-3-phosphate dehydrogenase 
(GAPDH) as a quantitative intemal control. The PGR primer sequences were 
5'- CGACCACTTTGTGAAGCTGA-3'(SEQ ID No.7) and 

20 5'- GGTTGAGCACAGGGTACTTTATT-3XSEQ ID No.8) for GAPDH; 
5'- CAAATATTAGGTGGAGCCAACAC-3' (SEQ IDNo.9) and 
5'- TAGATCACCTTGGCAAAGAACAC-3' (SEQ IDNo.lO) for ^5557, 
5'-ACCTCAAGTGCCTCCTGGAA-3' (SEQ ID No.ll) and 
5'-TCAGTTTCAAGAGGTAAGGGGAT-3' (SEQ ID No.l2) for B9769, 

25 5'- AGAGGGATAGAAAGTGCTGCTCT-3' (SEQ ID No.l3) and 

5'- CATAACTGCATAGACAGCACGTC-3'.(SEQ ID No.l4) for C7965. 

Northern-hlot analysis: 

Total RNAs were extracted from all breast cancer cell lines using RNeasy kit 
(QIAGEN) according to the manufacturer's instructions. After treatment with DNase I 
30 (Nippon Gene, Osaka, Japan), mRNA was isolated with mRNA purification kit (Amersham 
Biosciences) following the manufacturer's mstructions. A 1-p.g aliquot of each mRNA, along 
with polyA(+) RNAs isolated fi'om normal adult human breast (Biochain), lung, heart, liver, 
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kidney, bone marrow (BD, Clontech, Palo Alto, CA), were separated on 1% denaturing 
agarose gels and transferred to nylon membranes (Breast cancer-Northern blots). Breast 
cancer- and Human multiple-tissue Northern blots (Clontech, Palo Alto, CA) were hybridized 
with an [a^^P]-dCTP-labeled PGR products of A5657. B9769 and C7965 prepared by RT-PCR 

5 (see below). Pre-hybridization, hybridization and washing were performed according to the 
sxq)plier's recommendations. The blots were autoradiographed with intensifying screens at - 
80"C for 14 days. Specific probes iotA5657 (541bp), B9769 (499bp) and C7965 (238bp) 
were prepared by PGR using a primer set as mentioned in semi-quantitative RT-PCR section; 
5'- CAAATATTAGGTGGAGCCAACAC-3' (SEQ IDNo.9) and 

10 5'- TAGATCACCTTGGCAAAGAACAC-3' (SEQ ID No.lO) for ^5557, 
5'-ACCTCAAGTCCCTCCTGGAA-3' (SEQ ID No.ll) and 
5'-TCAGTTTCAACAGGTAAGGCGAT-3' (SEQ ID No. 12) for 5'- 
GGGAAGAGAAGTCCCGAGTC-3' (SEQ ID No.l5) and 
5'- TCCTTATTCTGAATTTCCAGAATC-3' (SEQ ID No.16) for C7965. 

15 

Rapid AmvUncation cDNA End (RACE)-PCR: 

To detect the full length of transcripts of C7965, 5' RACE was performed with a 
Marathon cDNA amplification kit (BD, CLONTECH) following the manufacturer's 
instruction. When 1^* PCR was performed with reverse primers (5'- 

20 CAAGCAGTCCTACCAGGGTTCGGAAGCTGA-3') (SEQ ID No.l7) using cDNA 

prepared from the breast cancer cell line, MCF7, as a template, multiple product bands were 
detected. 1^ 5' RACE-PCR was performed under the following conditions: initial 
denaturation at 94''C for 1 min; 35 cycles of 94*'C for 30 sec; 68°C for 2 min; followed by a 
final elongation at 6%^C for 7 min. After the 1**-PCR products were diluted, strong product 

25 bands were detected by Nested PCR with nested reverse primers (5'- 

CCAGGGTTCGGAAGCTGAGAGGAGCAGTTT-3') (SEQ ID No.l 8). Nested-PCR was 
performed under the following conditions: initial denaturation at 94°C for 1 min; 2 cycles of 
94'C for 30 sec, 72*C for 2min, and 2 cycles of 94'C for 30 sec, 70*C for 2min, 15 cycles of 
94*'C for 30 sec, for 2min; followed by a final elongation at 68^ for 7 min. Sequences 

30 were identified using Gel extraction kit (Qiagen) and TOPO TA cloning kit (hivitrogen), 
following the manufacturer's instruction. 
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Constniction of expression vectors: 

The entire coding sequences of A5657, B9769 and C7965 cDNA were amplified by 
PGR using KOD-Plus DNA polymerase (Toyobo, Osaka, Japan) with primers 
A5657 forward; 5'-CCGGAATTCATGCAGAGAGCTTCACGTCTG-3' (SEQ ID No. 19) 
S and 

A5657 reverse; 5'-CCGCTCGAGAACATCAGGATGAAATTTCTTTTC-3' (SEQ ID 
No.20), 

primers B9769-forward; 5'- CCGGAATTCATGAGCGGTGCGGGGGTGGCG -3' (SEQ ID 
No.21) and 

10 B9769-reverse; 5'- CCGCTCGAGAAGCACTGAGCGATGCAGGCG -3' (SEQ ID No.22) 
and 

primers C7965-forward; 5'- CCGGAATTCATGGACGCAGAGCTGGCAGAGGTGCG-3' 
(SEQIDNo.23)and 

C7965-reverse; 5'-CCGCTCGAGGTTGTTCTCCTCTGCACAAAG-3' (SEQ ID No.24). 

15 The PGR products were inserted into the EcoRI and Xhol sites of pGAGGS-3xFLAGn and 
pCAGGSnHA, pcDNA3.1(+)-Myc/His (Invitrogen) expression vectors, respectively. 
For ubiquitin binding assay, the HA-ubiquitin expression vector (pcdef3-HA-RPS27A) was a 
gift from Dr. Kohei Miyazono (The Cancer Institute of Japanese Foimdation for Gancer 
Research). These constructs (pCAGGS- A5657-HA, pcDNA3.1- A5657-Myc/His, pGAGGS- 

20 3xFLAG-A5657, and pcdeG-HA-RPS27A(Ub80a)) were confirmed by DNA sequencing. 
For cell growth promoting assay, the entire coding sequence of mutant H-ras (G12K) was 
constructed by Dr. Motoko Unoki. 

Immmocvtochemical stainins: 
25 To examine the sub-cellular localization of the A5657, B9769 and G7965 proteins, 

T47D cells were seeded at 5x10^ cells per well for A5657 and B9769, and G0S7 cells at 

1x10^ (low density) and 1x10^ (high density) for B9769 and C7965. After 24 hours, the cells 
were transiently transfected with Ijig of pGAGGS-A5657-HA into T47D cells using 
FuGENE 6 transfection reagent (Rodie) according to the manu&cturer*s instructions, 
30 respectively. Then, the cells were fixed witii PBS containing 4% paraformaldehyde for 1 5 
min, and rendered permeable with PBS containing 0.1% Triton X-100 for 2.5 min at 4°C. 
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Subsequently the cells were covered with 3% BSA in PBS for 12 hours at 4°C to block non- 
specific hybridization. Next, A5657-HA-transfected T47D cells were incubated with a 
mouse anti-HA antibody (SANTA CRUZ) at 1 : 1000 dilution or a mouse anti-myc antibody 
(Sigma) at 1 : 1000 dilution, respectively. After washing with PBS, both transfected-cells were 

5 stained by an Alexa594-conjugated anti-mouse secondary antibody (Molecular Probe) at 
1:5000 dilution. Moreover, the sub-cellular localization between B9769 and other 
cytoskelton proteins was compared. After transfection with 1 jig of pcDNA3. l(+)-B9769- 
myc-his, cells were with a rabbit anti-myc antibody (SANTACRUZ) at 1 :1000 dilution or a 
mouse anti-p tubulin antibody(SIGMA) at 1 :500 dilution. After washing with PBS, 

10 transfected-T47D cells were stained by an Alexa488-conjugated anti-rabbit secondary 
antibody (Molecular Probe), an Alexa594-conjugated anti-mouse secondary antibody 
(Molecular Probe) at 1:5000 dilution, Alexa594 conjugated Phalloidin at 1:50 dilution. 
Furthermore, to examine sub-cellular localization of B9769 in low density or high density of 
cells, C0S7 cells were transfected with Ijig of pCAGGSn3F-B9769-HA, and performed 

15 unmunocytochemical staining with the same procedures as above mentioned. Transfected 
cells were incubated with a mouse anti-HA antibody (SANTACRUZ) at 1 : 1000 dilution. 
After washing with PBS, C0S7 cells were stained by an Alexa594-conjugated anti-mouse 
secondary antibody (Molecular Probe) at 1 :5000 diWon. Nuclei were counter-stained with 
4',6'-diamidine-2'-phenylmdole dihydrochloride (DAPI). Furthermore, to examine sub- 

20 cellular localization of C7965 in low density or high density of cells, C0S7 cells were 

transfected with lp.g of pcDNA3.1(+)-C7965-Myc/His, and performed immunocytochemical 
staining with flie same procedures as above mentioned. Transfected cells were incubated with 
a mouse anti-myc antibody (SANTACRUZ) at 1 : 1000 dilution. After washing with PBS, 
C0S7 cells were stained by an Alexa594-conjugated anti-mouse secondary antibody 

25 (Molecular Probe) at 1 :5000 diltion. Nuclei were counter-stained with 4\6'-diamidine-2'- 
phenylindole dihydrochloride (DAPI). Fluorescent images were obtained under a TCS SP2 
AOBS microscope (Leica, Tokyo, Japan). Construction ofA5657. B9769 and C7965 specific^ 
siKNA expression vector using psiU6X3.0: 

A vector-based RNAi system was established using a psiU6BX siRNA expression 

30 vector described in the literature (Shimokawa T., Furukawa Y., Sakai M., Li M., Miwa N., 
Lin Y. M. Nakamura Y. Involvement of the FGF18 Gene in Colorectal Carcinogenesis, as a 
Novel Downstream Target of the B-Catemn/T-Cell Factor Complex63, Cancer Res., 63, 61 16- 
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20, 2003). An siRNA expression vector againsty455J7 (psiU6BX-A5657), B9769 (psiU6BX- 
B9769) and C7965 (psiU6BX- C7965) were prepared by cloning of double-stranded 
oligonucleotides into the Bbsl site in the psiU6BX vector. A control plasmid, psiU6BX- 
EGFP, was prepared by cloning double-stranded oligonucleotides of 
5 5'-CACCGAAG CAGCACGACTTCTTCTTCAAGAGAGAAGAAGTCGTGCTGCTTC-3' 
(SEQIDNo.25)and 

5'- AAAAGAAGCAGCACGACTTCTTCTCTCTTGAAGAAGAAGTCGTGCTGCTTC -3' 
(SEQ ID No.26) into the Bbsl site in the psiU6BX3.0 vector. 



Table. 1 





target sequense 


SEQ ID No. 


siEGFP 


5'-GAAGCAGCACGACTTCTT-3' 


27 


A5657 si2 


5'-CATCGCAACTGTGTTGACC-3' 


28 


A5657 si3 


5'-TGCCAGACAGTGGACAGAG-3' 


29 


B9769 sil 


5*-GCCTGCAGTTCCTGCAGCA-3' 


30 


B9769 si2 


5'-GCTTCCAGTCTGTCAAGTC-3' 


31 


B9769 si4 


5'-AGCAGAGGCCTCTAATGCA-3' 


32 


C7965 sil 


5'-ACTGCTCCTCTCAGCTTCC-3' 


33 


C7965 si3 


5'-GTACGCTTACTGGCATCAA-3' 


34 



10 Gene-Silencing effect ofA5657, B9769 and C7965: 

Human breast cancer cells line, T47D, was plated onto lO-cm dishes (1 X 10^ 
cells/dish) and transfected with psiU6BX-EGFP as negative control, psiU6BX-A5657 using 
FuGENE6 reagent according to the supplier's recommendations (Roche). Total RNA was 
extracted from the cells at 7 days after the transfection, and then the knockdown effect of 

15 siRNAs was confirmed by semi-quantitative RT-PCR using specific primers for A5657, 

B9769 and C7965, and for GAPDHas above mentioned. Moreover, transfectants expressing 
siRNAs using T47D cell lines were grown for 28 days in selective media containing 0.7 
mg/ml of neomycin. After fixation with 4% paraformaldehyde, transfected cells were stained 
with Giemsa solution to assess colony formation. MTT assays were performed to quantify 

20 cell viabiUty. After 7 days of culture in the neomycin-containing medium, MTT solution (3- 
(4,5-dimethylthiazol-2-yl)-2,5-diphenyl tetrazolium bromide) (Sigma) was added at a 
concentration of 0.5 mg/ml. Following incubation at 37*^C for 2.5 hours, acid-SDS (0.0 IN 
HC1/10%SDS) was added; the suspension was mixed vigorously and then incubated overnight 
at 37^C to dissolve the dark blue crystals. Absorbance at 570nm was measured with a 

25 Microplate Reader 550 (BioRad). 
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Cell Proliferation Assays: 

NIH3T3 cells were seeded at 3x10^ cells per 150mm dish. After 24 hours, 16 of 
pcDNA3.1(+)-Myc/His (Invitrogen) as anegative control, pcDNA3.1(+)-A5657-Myc/His, 
5 pcDNA3.1(+)-B9769-Myc/ffis, pcDNA3.1(+)-C7965 or pcDNA3.1(+)-H-Ras-miitant- 
Myc/His as a positive control were transiently transfected using FuGENE 6 (Roche) into 
NIH3T3 cells, respectively. Next, 2 x lO'' of NIH3T3 cells per well were re-seeded at 24 
hours after transfection. MTT assays were performed at 1, 2, 4 and 6 days after re-seedling as 
described previously. 

10 

Ubiquitin bin(Sng analysis 

COS? cells were seeded at 5x1 0^ cells per well. After 24 hours, 1 jig of FLAG- 
tagged A5657 and 0.1 ng of HA-tagged-ubiquitin expression vector (Ub80a-HA) were 
cotransfected using FuGE]SIE6 (Roche) into C0S7 cells. The HA-tagged-ubiquitin expression 

15 vector was constructed according to Imamura et al. and Ebisawa et al (Imamura et al., Nuture 
389/6651, 622-6, 1997 "Smad6 inhibits signalling by the TGF-beta super family"; Ebisawa et 
al., J Biol Chem. 20:276/16, 12477-80, 2001 "Smurfl interacts with transforming growth 
factor-beta type I receptor through Smad7 and induces receptor degradation"). The cells were 
added 5\M proteasome inhibitor, MG132 or SfiM DMSO at 24 hours after transfection, and 6 

20 hour later, the cells were lysed in 1 ml of 1 .0 % NP40 buffer (1 .0 % NP40, 50 mM Tris-HCl 
(pH 8.0), 150 mMNaCl, and complete protease-inhibitor cocktail). The cell lysates were 
immunoprecipitated with agarose conjugated with mouse monoclonal anti-FLAG M2- 
antibody or mouse anti-HA antibody. After six rounds of washing with lysis buffer, the 
binding proteins were eluted with Flag-M2 or HA peptides (SIGMA). The 

25 immunoprecipitates were analysed on Westem blots using rabbit anti-HA polyclonal antibody, 
or rabbit anti-Flag polyclonal antibody. The soluble proteins in sample buffer were loaded on 
12% SpS-polyacrylamide gels and transferred to nitrocellulose membranes (Hybond ECL6). 
The membranes were blocked with BlockAce powder (Yukijirushi, Tokyo, Japan) in Tris- 
buffered saline containing 0.05% Tween 20 (TBST) and incubated with each antibody for 1 

30 hour at room temperature. Hie blots were then hybridized with HRP-conjugated secondary 
antibody (Amershann Biosciences), and detected using the ECL method (Amersham 
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Biosciences). 

Results 

Identification ofA5657, B9769 and C7965 as up-regulated genes in breast cancer cells: 
When gene-expression profiles of cancer cells firom 81 breast cancer patients were 

5 analyzed using a cDNA microarray representing 23,040 human genes, 102 genes that were 
conjmonly up-regulated were identified in breast cancer cells. From these up-regulated genes, 
the genes with m-house code A5657, which designated HSPC150 protein similar to ubiquitin- 
conjugating enzyme (Genbank Accession NM_014176) (SEQ ID NO: 1), with B9769, which 
designated hypothetical protem BC016861 (Genbank accession No. NM_138770) (SEQ ID 

10 NO: 3), and with C7965, which corresponded to EST (SEQ ID NO: 5) were selected. 

Expression of the genes A5657, B9769 and C7965 were elevated in 38 of 49, 30 of 73 and 28 
of 49 breast cancer cells on the microarray in comparison with normal breast ductal cells, 
respectively. To confirm the expression of these up-regulated genes we focused, semi- 
quantitative RT-PCR analysis was performed to compare the expression of them between 

15 breast cancer cells and normal human tissues including normal breast cells. Fkstly, it was 
discovered that A5657 showed the elevated expression in 9 of 12 clinical breast cancer 
samples (poorly-differentiated type) as compared to normal breast ductal cells and some other 
normal tissues, and was over-expressed in all of 6 breast cancer cell lines as well (Figure la). 
Next, is was discovered that B9769 showed the elevated expression in 6 of 12 clinical breast 

20 cancer samples (poorly-differentiated type) as compared to normal breast ductal cells, and 
was over-expressed in 4 of 20 breast cancer cell lines (Figure lb). Finally, it was revealed 
that C7965 showed the elevated expression in 7 of 12 clinical breast cancer samples (well- 
differentiated type) compared to normal breast ductal cells, and was over-expressed in 15 of 
20 breast cancer cell lines (Figure Ic). 

25 To fijrther examine the expression pattern of these up-regulated genes. Northern blot 

analysis was performed with multiple-human tissues and breast cancer cell lines using cDNA 
fragments of A5657, B9769 and C7965 as probes (see above). As a result, A5657 was found 
to be ubiquitously expressed except lung, liver, pancreas and peripheral blood leukocytes 
(Figure 2a; the upper panel), while was surprisingly over-expressed in all of breast cancer cell 

30 lines as compared to other normal tissues, especially heart and bone marrow, each of which 
showed strong signals as shown in the upper panel of Figure 2a (Figure 2a; the bottom panel). 
B9769 was exclusively expressed in testis and prostate (Figure 2b, the upper panel) and was 
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found to be over-expressed in some of breast cancer cell lines as compared to other normal 
tissues, especially nomial human breast (Figure 2b, the bottom panel). When Northern blot 
analysis was performed with mutiple-tissues and breast cancer cell lines using a C7965 
fragment within exon2 and 3 as a probe, approximately 1 .35 kb transcripts were observed. 
5 The transcript of 1 .35 kb was specijScally expressed in breast cancer cell lines than in normal 
tissues including breast tissue (Figure 2c, the bottom panel), and only weakly expressed in 
testis , skeletal muscle and small intestine (Figure 2c, the upper panel). . 

Genomic structure ofA5657. B9769 and C7965: 

10 To obtain the entire cDNA sequences for A5657, B9769 and C7965, RT-PCR was 

performed using a cDNA prepared from a breast cancer cell line and T47D as template. 
A5657 consists of 7 exons, designated HSPC150 protein similar to ubiquitin-conjugating 
enzyme, located on the chromosome lq32.1 spanning approximately 10.3 kb in the genome. 
The fiill-length cDNA sequence of A5657 contained 928 nucleotides. The open reading frame 

15 (ORF) start at exon 2, and ends at exon 7. Eventually, this transcript encodes 197 amino acids. 
B9769 consists of 8 exons, located on the chromosome 2q21.2 spanning 
approximately 5.7kb in the genome. The fiiU-length cDNA sequence of B9769 contained 
1472 nucleotides. The ORF start at exon 1, and the ends at exon 8. Eventually, this transcript 
encodes 378 amino acids. 

20 To further isolate the 5' end of the C7965 transcript, rapid amplification of cDNA 

ends (RACE) was performed using a cDNA prepared from a breast cancer cell line, MCF7, as 
a template (see material and methods). A transcript was isolated consisting of 8 exons, 
corresponding to 1.35kb in Northem blot (Figure 2c, the bottom panel), respectively, and 
located on the chromosome 9q spanning approximately 28.8 kb in the genome. The fiill- 

25 length cDNA sequence of C7965 contained 1315 nucleotides. The ORF of the C7965 cDNA 
start within exon 1 and ends at exon 8. Eventually, this transcript encodes 288 amino acids. 

Subcellular localization ofA5657 B9769 and C7965: 

To fiirther examine the characterization of A5657, B9769 and C7965, the sub-cellular 
30 localization of these gene products was examined in mammalian cells. Firstly, when we 
transiently transfected plasmids e)q)ressmg the A5657 protein (pCAGGS-A5657-HA) were 



wo 2005/029067 



PCT/JP2004/011741 



-69- 

transiently transfected into T47D cells, immunocytochemical staining revealed exogenous 
A5657 localized throughout the cytoplasm in approximately 80% of all transfected-T47D 
cells and nucleus in the remaining approximately 20% of cells (Figure 3a). 

Next, when a plasmid expressing the B9769 protem (pCAGGS-Flag-B9769-HA) was 

5 transiently transfected into C0S7 cells, immunocytochemical staining revealed that the 
B9769 protein was localized to the cytoplasmic apparatus as the intemiediate filaments in a 
small number of cells (low density), but localized to the cytoplasm in a large number of cells 
(high density) (Figure 3b), suggesting that B9769 may play a key role of interaction of cell to 
cell. To further examine the localization of this protein in more detail, localization of other 

10 cytoskeletal proteins was compared by immunocytochemical staining. As a result, B9769 and 
other cytoskeletal proteins were not co-localized to the cytoplasmic apparatus as the filaments 
in T47D cells (Figure 3c). 

Finally, a plasmid expressing C7965 protein (pcDNA3.1(+)-C7965-Myc/His) was 
transiently transfected into C0S7 cells, immunocytochemical staining reveals the C7965 
15 protein localized to the cytoplasmic apparatus in C0S7 cells (Figure 3d), 

Growth-inhibitory effects of small-interfering RNA (siRNA) designed to reduce expression of 
A565ZB9769andC7965: 

To assess the growth-promotmg role of A5657, B9769 and C7965, tiie expression of 

20 endogenous A5657 and B9769 in the breast cancer Ime T47D, a line that has shown the over- 
expression of A5657, was knock down by means of the mammalian vector-based RNA 
interference (RNAi) technique (see Materials and Methods) (Figure 4, 5 and 6), Ejqpression 
levels of A5657, B9769 and C7965 were examined by semi-quantitative RT-PCR 
experiments. As shown in Figure 4, 5 and 6, A5657 (si2 and si3), B9769 (sil and si2, si4), 

25 and C7965 (sil and si3) -specific siRNAs significantly suppressed expression of each gene 
compared with control siRNA construct (psiU6BX-EGFiP). To confirm the cell growth 
inhibition with A5657, B9769 and C7965-specific siRNAs, colony-formation and MTT 
assays were performed, respectively. As a result, introduction of A5657 (si2 and si3) (Figure 
4b,c), B9769 (sil and si2, si4) (Figure 5b,c) and C7965 (sil and si3) (Figure 6b,c) constructs 

30 suppressed growth of T47D cells, consistent to the result of above reduced expression. Each 
result was verified by three independent experiments. Thus, these findings suggest that 
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A5657, B9769 and C7965 have a significant function in the cell growth of the breast cancer. 

Growth promoting effect of transient over-expressing A5657, B9769 and C7965 into NIH3T3 
cells: 

5 To further examine a possible role of A5657, B9769 and C7965 in the cell growth 

regulation, plasmids expressing A5657, B9769, C7965, H-ras mutant as positive control or 
Mock as negative control were transientiy transfected into NIH3T3 cells, respectively, and 
then performed MTT assay (Figure 7). After 6 days of transfection, over-expression of 
A5657 (Figure 7a), B9769 (Figure 7b) and C7965 (Figure 7c) produced significant 

10 enhancement of cell growth (over 3-fold) as well as over-expression of H-ras mutant 

compared with NIH3T3 cells transfected with control plasmid (mock vector), suggesting that 
A5657, B9769 and C7965 might play roles in proliferation of cell growfli. These results were " 
confirmed by two independent experiments. 

15 A5657 protein binds ubiquitin: 

To explore the function of A5657 in more detail, Westem blot analysis usmg whole 
cell lysates firom A5657 expressing plasmid-transfected C0S7 cells was performed. As a 
result, only one extra slower migrating band was observed in addition to the band showing the 
expected molecxilar weight OPigure 8; left panel, in whole cell lysate), suggesting this might 

20 be generated by posttranslational modification. SMART program predicted A5657, encoded 
197 amino-acid, would contain UBCc domain (Ubiquitin-conjugating enzyme E2, catalytic 
domain homologues) (5-1 52 residue), suggesting that A5657 might have a potential E2 
ubiquitin enzyme activity. To investigate whether this extra band was due to mono- 
ubiquitination, plasmid DNAs designed to express FLAG-tagged A5657 (A5657-FLAG) and 

25 HA-tagged ubiquitin (UB80a-HA) were co-transfected into C0S7 cells. A5657 was 

immunoprecipitated using anti-FLAG antibody and the precipitate was detectable with anti- 
HA antibody. An extra slow migrating band was observed, indicating that A5657 binds an 
ubiquitin (Figure 8). Moreover, when immunopresipitation was performed using anti-HA 
antibody and then Westem blot analysis with anti-FLAG, an extra slow migrating band was 

30 also observed as well (Figure 8). Treatment of MG132, proteaseome inhibitor, had no effect 
on this binding. This findmg strongly suggests that A5657 protein might have E2 ubiquitin 
enzyme activity via mono-ubiquitination. 
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DlSCUSSION 

Herein, through the precise expression profiles of breast cancer by means of genome 
wide cDNA microarray, novel genes, A5657, B9769 and were isolated that were 

5 significantly over-expressed in breast cancer cells as compared to normal human tissues. 

Among them, A5657 contains a UBCc domain (ubiquitin-conjugating enzyme E2, 
catalytic domain homologues) and, accordingly, may bind ubiquitin. This finding strongly 
suggests that the A5657 protein has E2 ubiquitm enzyme activity via mono-ubiquitination, 
and, accordingly, may be involved in breast cancer tumorigenesis. 

10 The B9769 protein was observed by immunochemical staining to localize in 

cytoplasm as intermediate filaments. The B9769 protein was further observed to localize in 
the cytoplasmic apparatus as the intermediate filaments under conditions of low density cell 
population, but in the cytoplasm xmder conditions of high density cell population, which 
suggests that B9769 may play a key role in cell to cell interaction. 

15 As demonstrated herein, treatment of breast cancer cells with siRNA ejffectively 

mhibited expression of all three target genes, A5657, B9769 and C7965, which, in turn, 
significantly suppressed breast cancer cell-tumor growth. Moreover, these genes when 
transiently over-expressed in NIH3T3 cells dramatically were demonstrated to promote cell 
proliferation in MTT assay. These findings suggest that A5657, B9769 and C7965 play key 

20 roles in tumor cell growth proliferation and are promising targets for development of anti- 
cancer drugs. 
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Industrial Applicability 

The gene-expression analysis of breast cancer described herein, obtained through a 
5 combination of laser-capture dissection and genome-wide cDNA microarray, has identified 
specific genes as targets for cancer prevention and therapy. Based on the expression of a 
subset of these differentially expressed genes, the present invention provides molecular 
diagnostic markers for identifying or detecting breast cancer. 

The methods described herein are also useful in the identification of additional 
10 molecular targets for prevention, diagnosis and treatment of breast cancer. The data reported 
herein add to a comprehensive understanding of breast cancer, facilitate development of novel 
diagnostic strategies, and provide clues for identification of molecular targets for therapeutic 
drugs and preventative agents. Such information contributes to a more profound 
understanding of breast tumorigenesis, and provide indicators for developing novel strategies 
15 for diagnosis, treatment, and ultimately prevention of breast cancer. 

All patents, patent applications, and publications cited herein are incorporated by 
reference in their entirety. 
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Furthennore, while ihe invention has been described in detail and with reference to 
specific embodiments thereof, it is to be understood that the foregoing description is 
exemplary and explanatory in nature and is intended to illustrate the invention and its 
preferred embodiments. Through routine experimentation, one skilled in the art will readily 
5 recognize that various changes and modifications can be made therein without departing firom 
the spirit and scope of the invention. Thus, the invention is intended to be defined not by the 
above description, but by the following claims and their equivalents. 



